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Description 

CROSS-REFERENCE TO RELATED APPLICATION 

5 This application is a continuation-in-part of U.S. Application Serial No. 07/147,351 filed January 22, 
1988. 

FIELD OF THE INVENTION 
70 The invention is in the field of recombinant genetics. 
BACKGROUND OF THE INVENTION 

The human and simian immunodeficiency viruses HIV and SIV are the causative agents of Acquired 

75 Immune Deficiency Syndrome (AIDS) and Simian Immunodeficiency Syndrome (SIDS), respectively. See 
Curren, J. et al. , Science 329:1359-1357 (1985); Weiss, R. et al. , Nature 324 : 572-575 (1986). The HIV virus 
contains an envelope glycoprotein, gp120 which binds to the CD4 protein present on the surface of helper T 
lymphocytes, macrophages and other cells. Dalgleish et al. Nature , 312:763 (1984). After the gp120 binds to 
CD4, virus entry is facilitated by an envelope-mediated fusion of the viral target cell membranes. 

20 During the course of infection, the host organism develops antibodies against viral proteins, including 
the major envelope glycoproteins gp120 and gp41. Despite this humoral immunity, the disease progresses, 
resulting in a lethal immunosuppression characterized by multiple opportunistic infections, parasitemia, 
dementia and death. The failure of host anti-viral antibodies to arrest the progression of the disease 
represents one of the most vexing and alarming aspects of the infection, and augurs poorly for vaccination 

25 efforts based upon conventional approaches. 

Two factors may play a role in the inefficacy of the humoral response to immunodeficiency viruses. 
First, like other RNA viruses (arid like retroviruses in particular), the immunodeficiency viruses show a high 
mutation rate which allows antigenic variation to progress at a high rate in response to host immune 
surveillance. Second, the envelope glycoproteins themselves are heavily glycosylated molecules presenting 

30 few epitopes suitable for high affinity antibody binding. The poorly antigenic, "moving" target which the viral 
envelope presents, allows the host little opportunity for restricting viral infection by specific antibody 
production. 

Cells infected by the HIV virus express the gp120 glycoprotein on their surface. Gp120 mediates fusion 
events among CD4 + cells via a reaction similar to that by which the virus enters the uninfected cell, leading 
35 to the formation of short-lived multinucleated giant cells. Syncytium formation is dependent on a direct 
interaction of the gp120 envelope glycoprotein with the CD4 protein. Dalgleish et al. , supra , Klatzmann, D. 
et al. , Nature 312:763 (1984); McDougal, J.S. et al. Science , 231:382 (1986); Sodroski, J. et al. , Nature , 
322:470 (1986); Lifson, J.D. et^L, Nature , 323:725 (1986); Sodroski, J. ertaL, Nature , 321:412 (1986). 

The CD4 protein consists of a 370 amino acid extracellular region containing four immunoglobulin-like 
40 domains, a membrane spanning domain, and a charged intracellular region of 40 amino acid residues. 
Maddon, P. et al. , Cell 42 :93 (1985); Clark, S. et al. , Proc. Natl. Acad. Sci. (USA) 84:1649 (1987). 

Evidence that CD4-gp120 binding is responsible for viral infection of cells bearing the CD4 antigen 
includes the finding that a specific complex is formed between gp120 and CD4. McDougal et al. , supra . 
Other workers have shown that ceil lines, which were non-infective for HIV, were converted to infectable cell 
45 lines following transfection and expression of the human CD4 cDNA gene. Maddon et al. , Cell 47:333-348 
(1986). 

In contrast to the majority of antibody-envelope interactions, the receptor-envelope interaction is 
characterized by a high affinity (K a « 10 8 l/mole) immutable association. Moreover, the affinity of the virus for 
CD4 is at least 3 orders of magnitude higher than the affinity of CD4 for its putative endogenous ligand, the 
so MHC class II antigens. Indeed, to date, a specific physical association between monomeric CD4 and class II 
antigens has not been demonstrated. 

In response to bacterial or other particle infection, the host organism usually produces serum antibodies 
that bind to specific proteins or carbohydrates on the bacterial or particle surface, coating the bacteria. This 
antibody coat on the bacterium or other particle stimulates cytolysis by Fc-receptor-bearing lymphoid cells 
55 by antibody-dependent cellular toxicity (ADCC). Other serum proteins, collectively called complement (C), 
bind to antibody-coated targets, and also can coat foreign particles nonspecifically. They cause cell death 
by lysis, or stimulate ingestion by binding to specific receptors on the macrophage called complement 
receptors. See Darnell J. et al. , in Molecular Cell Biology , Scientific American Books, pp. 641 and 1087 
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(1986). 

The most effective complement activating classes of human Ig are IgM and lgG1. The complement 
system consists of 14 proteins that, acting in order, cause lysis of cells. Nearly all of the C proteins exist in 
normal serum as inactive precursors. When activated, some become highly specific proteolytic enzymes 

5 whose substrate is the next protein in a sequential chain reaction. 

The entire C sequence can be triggered by either of two initiation pathways. In one (the classic 
pathway), Ab-Ag complexes bind and activate C1 , C4 and C2 to form a C3-spiitting enzyme. In the second 
pathway, polysaccharides commonly on the surface of many bacteria and fungi bind with trace amounts of 
a C3 fragment and then with two other proteins (factor B and properdin) to form another C3-spiitting 

70 enzyme. Once C3 is split by either pathway, the way is open for the remaining sequence of steps which 
lead to cell lysis. See Davis, B.D., et al. , In Microbiology , 3rd ed., Harper and Row, Philadelphia, PA, pp. 
452-466 (1980). 

A number of workers have disclosed methods for preparing hybrid proteins. For example, Murphy, 
United States Patent 4,675,382 (1987), discloses the use of recombinant DNA techniques to make hybrid 

75 protein molecules by forming the desired fused gene coding for a hybrid protein of diptheria toxin and a 
polypeptide ligand such as a hormone, followed by expression of the fused gene. 

Many workers have prepared monoclonal antibodies (Mabs) by recombinant DNA techniques. Mon- 
oclonal antibodies are highly specific well-characterized molecules in both primary and tertiary structure. 
They have been widely used for in vitro immunochemical characterization and quantitation of antigens. 

20 Genes for heavy and light chains have been introduced into appropriate hosts and expressed, followed by 
reaggregation of the individual chains into functional antibody molecules (see, for example, Munro, Nature 
312:597 (1984); Morrison, S.L, Science 229:1202 (1985); Oi et_al., Biotechniques 4:214 (1986); Wood et al ., 
Nature 314:446-449 (1985)). Light- and heavy-chain variable regions have been cloned and expressed in 
foreign hosts wherein they maintained their binding ability (Moore et al. , European Patent Application 

25 0088994 (published September 21 , 1983)). 

Chimeric or hybrid antibodies have also been prepared by recombinant DNA techniques. Oi and 
Morrison, Biotechniques 4:214 (1986) describe a strategy for producing such chimeric antibodies which 
include a chimeric human IgG anti-leu3 antibody. 

Gascoigne, N.R.J., et al. , Proc. Natl. Acad. Sci. (USA) 84:2936-2940 (1987) disclose the preparation of a 

30 chimeric gene construct containing a T-cell receptor a-chain variable (V) domain and the constant (C) region 
coding sequence of an immunoglobulin T 2a molecule. Cells transfected with the chimeric gene synthesize a 
protein product that expresses immunoglobulin and T-cell receptor antigenic determinants as well as protein 
A binding sites. This protein associates with a normal X chain to form an apparently normal tetrameric 
(H2L2, where H = heavy and L = light) immunoglobulin molecule that is secreted. 

35 Sharon, J., et al. , Nature 309:54 (1984), disclose construction of a chimeric gene encoding the variable 
(V) region of a mouse heavy chain specific for the hapten azophenylarsonate and the constant (C) region of 
a mouse kappa light chain (V H C K ). This gene was introduced into a mouse myeloma cell line. The chimeric 
gene was expressed to give a protein which associated with light chains secreted from the myeloma cell 
line to give an antibody molecule specific for azophenylarsonate. 

40 Morrison, Science 229:1202 (1985), discloses that variable light-or variable heavy-chain regions can be 
attached to a non-lg sequence to create fusion proteins. This article states that the potential uses for the 
fusion proteins are three: (1) to attach antibody specifically to enzymes for use in assays; (2) to isolate non- 
lg proteins by antigen columns; and (3) to specifically deliver toxic agents. 

Recent techniques for the stable introduction of immunoglobulin genes into myeloma cells (Banerji, J., 

45 et al ., Cell 33:729-740 (1983); Potter, H., et al ., Proc. Natl. Acad. Sci. (USA) £H:7161-7165 (1984)), coupled 
with detailed structural information, have permitted the use of in vitro DNA methods such as mutagenesis, 
to generate recombinant antibodies possessing novel properties. 

PCT Application WO87/02671 discloses methods for producing genetically engineered antibodies of 
desired variable region specificity and constant region properties through gene cloning and expression of 

50 light and heavy chains. The mRNA from cloned hybridoma B cell lines which produce monoclonal 
antibodies of desired specificity is isolated for cDNA cloning. The generation of light and heavy chain 
coding sequences is accomplished by excising the cloned variable regions and ligating them to light or 
heavy chain module vectors. This gives cDNA sequences which code for immunoglobulin chains. The lack 
of introns allows these cDNA sequences to be expressed in prokaryotic hosts, such as bacteria, or in lower 

55 eukaryotic hosts, such as yeast. 

The generation of chimeric antibodies in which the antigen-binding portion of the immunoglobulin is 
fused to other moieties has been demonstrated. Examples of non-immunoglobulin genes fused to anti- 
bodies include Staphylococcus aureus nuclease, the mouse oncogene c- myc , and the Klenow fragment of 
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E. coli DNA polymerase I (Neuberger, M.S., et al„ Nature 312:604-612 (1984); Neuberger, M.S., Trends in 
Biochemical Science , 347-349 (1985)). European Patent Application 120,694 discloses the genetic engineer- 
ing of the variable and constant regions of an immunoglobulin molecule that is expressed in E. coli host 
cells. It is further disclosed that the immunoglobulin molecule may be synthesized by a host cell with 

5 another peptide moiety attached to one of the constant domains. Such peptide moieties are described as 
either cytotoxic or enzymatic. The application and the examples describe the use of a lambda-like chain 
derived from a monoclonal antibody which binds to 4-hydroxy-3-nitrophenyl (NP) haptens. 

European Patent Application 125,023 relates to the use of recombinant DNA techniques to produce 
immunoglobulin molecules that are chimeric or otherwise modified. One of the uses described for these 

70 immunoglobulin molecules is for whole-body diagnosis and treatment by injection of the antibodies directed 
to specific target tissues. The presence of the disease can be determined by attaching a suitable label to 
the antibodies, or the diseased tissue can be attacked by carrying a suitable drug with the antibodies. The 
application describes antibodies engineered to aid the specific delivery of an agent as "altered antibodies." 
PCT Application WO83/101533 describes chimeric antibodies wherein the variable region of an 

75 immunoglobulin molecule is linked to a portion of a second protein which may comprise the active portion 
of an enzyme. 

Boulianne et al ., Nature 312:643 (1984) constructed an immunoglobulin gene in which the DNA 
segments that encode mouse variable regions specific for the hapten trinitrophenol (TNP) are joined to 
segments that encode human mu and kappa regions. These chimeric genes were expressed to give 

20 functional TNP-binding chimeric IgM. 

Morrison et al. , P.N.A.S. (USA) 81^:6851 (1984), disclose a chimeric molecule utilizing the heavy-chain 
variable region exons of an anti-phosphoryl choline myeloma protein G, which were joined to the exons of 
either human kappa light-chain gene. The genes were transfected into mouse myeloma cell lines, 
generating transformed cells that produced chimeric mouse-human IgG with antigen-binding function. 

25 Despite the progress that has been achieved on determining the mechanism of HIV infection, a need 
continues to exist for methods of treating HIV viral infections. 

SUMMARY OF THE INVENTION 

30 The invention relates to a gene comprising a DNA sequence which encodes a fusion protein comprising 
1) CD4, or a fragment thereof which binds to HIV gp120, and 2) an immunoglobulin light or heavy chain; 
wherein said CD4 or HIV gp120-binding fragment thereof replaces the variable region of the light or heavy 
immunoglobulin chain. 

The invention also relates to vectors containing the gene of the invention and hosts transformed with the 
35 vectors. 

The invention also relates to a method of producing a fusion protein comprising CD4, or fragment 
thereof which binds to HIV gp120, and an immunoglobulin light or heavy chain, wherein the variable region 
of the immunoglobulin light or heavy chain has been substituted with CD4, or HIV gp120-binding fragment 
thereof, which comprises: 

40 cultivating in a nutrient medium under protein producing conditions, a host strain transformed with the 
vector containing the gene of the invention, said vector further comprising expression signals which are 
recognized by said host strain and direct expression of said fusion protein, and 
recovering the fusion protein so produced. 

The invention also relates to a fusion protein comprising CD4, or fragment thereof which is capable of 
45 binding to HIV gp120, fused at the C-terminus to a second protein which comprises an immunoglobulin light 
or heavy chain, wherein the variable region of said light or heavy chain is substituted with CD4 or a HIV 
gp120 binding fragment thereof. 

The invention also relates to an immunoglobulin-like molecule comprising the fusion protein of the 
invention together with an immunoglobulin light or heavy-chain, wherein said immunoglobulin like molecule 
50 binds HIV gp120. 

The lgG1 fusion proteins and immunoglobulin-like molecules may be useful for both complement- 
mediated and cell-mediated (ADCC) immunity, while the IgM fusion proteins are useful principally through 
complement-mediated immunity. 

The invention also relates to a complex between the fusion proteins and immunoglobulin-like molecule 
55 of the invention and HIV gp120. 

The invention also relates to a method for treating HIV or SIV infections comprising administering the 
fusion protein or immunoglobulin-like molecule of the invention to an animal. 
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The invention further relates to a method for detecting HIV gp120 in a sample comprising contacting a 
sample suspected of containing HIV or gp120 with the fusion protein or immunoglobulin-like molecule of the 
invention, and detecting whether a complex has formed. 

5 DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The invention is directed to a protein gene which comprises 

1) a DNA sequence which codes for CD4, or fragment thereof which binds to HIV gp120, fused to 

2) a DNA sequence which encodes an immunoglobulin heavy chain. 
w Preferably, the antibody has effector function. 

The invention is also directed to a protein gene which comprises 

1) a DNA sequence which codes for CD4 P or fragment thereof which binds to HIV gp120, fused to 

2) a DNA sequence which encodes an immunoglobulin light chain; wherein said sequence which codes 
for CD4, or HIV gp120-binding fragment thereof, replaces the variable region of the light immunoglobulin 

75 chain. 

The invention is also directed to the expression of these novel fusion proteins in transformed hosts and 
the use thereof to treat and diagnose HIV infections. In particular, the invention relates to expressing said 
genes in mammalian hosts which express complementary light or heavy chain immunoglobulins to give 
immunoglobulin-like molecules which have antibody effector function and also bind to HIV or SIV gp120. 
20 The term "antibody effector function" as used herein denotes the ability to fix complement or to 
activate ADCC. 

The fusion proteins and immunoglobulin-like molecules may be administered to an animal for the 
purpose of treating HIV or SIV infections. By the terms "HIV infections" is intended the condition of having 
AIDS, AIDS related complex (ARC) or where an animal harbors the AIDS virus, but does not exhibit the 

25 clinical symptoms of AIDS or ARC. By the terms "SIV infections" is intended the condition of being infected 
with simian immunodeficiency virus. 

By the term "animal" is intended all animals which may derive benefit from the administration of the 
fusion proteins and immunoglobulin-like molecules of the invention. Foremost among such animals are 
humans, however, the invention is not intended to be so limited. 

30 By the term "fusion protein" is intended a fused protein comprising CD4, or fragment thereof which is 
capable of binding to gp120, linked at its C-terminus to an immunoglobulin chain wherein a portion of the N- 
terminus of the immunoglobulin is replaced with CD4. In general, that portion of immunoglobulin which is 
d.eleted is the variable region. The fusion proteins of the invention may also comprise immunoglobulins 
where more than just the variable region has been deleted and replaced with CD4 or HIV gp120 binding 

35 fragment thereof. For example, the V H and CH1 regions of an immunoglobulin chain may be deleted. 
p re f era biy f any amount of the N-terminus of the immunoglobulin heavy chain can be deleted as long as the 
remaining fragment has antibody effector function. The minimum sequence required for binding com- 
plement encompasses domains CH2 and CH3. Joining of Fc portions by the hinge region is advantageous 
for increasing the efficiency of complement binding. 

40 The CD4 portion of the fusion protein may comprise the complete CD4 sequence, the 370 amino acid 
extracellular region and the membrane spanning domain, or the extracellular region. The fusion protein may 
comprise fragments of the extracellular region obtained by cutting the DNA sequence which encodes CD4 
at the BspM1 site at position 514 or the Pvull site at position 629 (see Table 1) to give nucleotide 
sequences which encode CD4 fragments which retain binding to gp120. In general, any fragment of CD4 

45 may be used as long as it retains binding to gp120. 

Where the fusion protein comprises an immunoglobulin light chain, it is necessary that no more of the 
Ig chain be deleted than is necessary to form a stable complex with a heavy chain Ig. In particular, the 
cysteine residues necessary for disulfide bond formation must be preserved on both the heavy and light 
chain moieties. 

50 When expressed in a host, e.g., a mammalian cell, the fusion protein may associate with other light or 
heavy Ig chains secreted by the cell to give a functioning immunoglobulin-like molecule which is capable of 
binding to gp120. The gp120 may be in solution, expressed on the surface of infected cells, or may be 
present on the surface of the HIV virus itself. Alternatively, the fusion protein may be expressed in a 
mammalian cell which does not secrete other light or heavy Ig chains. When expressed under these 

55 conditions, the fusion protein may form a homodimer. 

Genomic or CDNA sequences may be used in the practice of the invention. Genomic sequences are 
expressed efficiently in myeloma cells, since they contain native promoter structures. 
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The constant regions of the antibody cloned and used in the chimeric immunoglobulin-like molecule 
may be derived from any mammalian source. The constant regions may be complement binding or ADCC 
active. However, preliminary work (see Examples) indicates that the fusion proteins of the invention may 
mediate HIV or SIV infected cell death by an ADCC or complement-independent mechanism. The constant 

5 regions may be derived from any appropriate isotype, including lgG1, lgG3, or IgM. 

The joining of various DNA fragments, is performed in accordance with conventional techniques, 
employing blunt-ended or staggered-ended termini for ligation, restriction enzyme digestion to provide 
appropriate termini, filling in of cohesive ends as appropriate, alkali and phosphatase treatment to avoid 
undesirable joining, and ligation with appropriate ligases. The genetic construct may optionally encode a 

io leader sequence to allow efficient expression of the fusion protein. For example, the leader sequence 
utilized by Maddon et a!. , Cell 42: 93-104 (1985) for the expression of CD4 may be used. 

For cDNA, the cDNA may be cloned and the resulting clone screened, for example, by use of a 
complementary probe or by assay for expressed CD4 using an antibody as disclosed by Dalgleish et al. , 
Nature 312:763-766 (1984); Klatzmann et al. , Immunol. Today 7:291-297 (1986); McDougal et al. , J. 

15 Immunol. 135:3151-3162 (1985); and McDougal, J. et al. , J. Immunol. 137:2937-2944 (1986). 

To express the fusion hybrid protein, transcriptional and translational signals recognized by an 
appropriate host element are necessary. Eukaryotic hosts which may be used include mammalian cells 
capable of culture in vitro , particularly leukocytes, more particularly myeloma cells or other transformed or 
oncogenic lymphocytes, e.g., EBV-transformed cells. Alternatively, non-mammalian cells may be employed, 

20 such as bacteria, fungi, e.g., yeast, filamentous fungi, or the like. 

p re f erre( j hosts for fusion protein production are mammalian cells, grown in vitro in tissue culture or in 
vivo in animals. Mammalian cells provide post translational modification to immunoglobulin protein mol- 
ecules which provide for correct folding and glycosylation of appropriate sites. Mammalian cells which may 
be useful as hosts include cells of fibroblast origins such as VERO or CHO-K1 or cells of lymphoid origin, 

25 such as the hybridoma SP2/0-AG14 or the myeloma P3x63Sgh, and their derivatives'. For the purpose of 
preparing an immunoglobulin-like molecule, a plasmid containing a gene which encodes a heavy chain 
immunoglobulin, wherein the variable region has been replaced with CD4 or fragment thereof which binds to 
gp120, may be introduced, for example, into J558L myeloma cells, a mouse plasmacytoma expressing the 
lambda- 1 light chain but which does not express a heavy chain (see Oi et al. , P.N.A.S. (USA) 80:825-829 

30 (1983)). Other preferred hosts include COS cells, BHK cells and hepatoma cells. 

The constructs may be joined together to form a single DNA segment or may be maintained as 
separate segments, by themselves or in conjunction with vectors. 

Where the fusion protein is not glycosylated, any host may be used to express the protein which is 
compatible with replicon and control sequences in the expression plasmid. In general, vectors containing 

35 replicon and control sequences are derived from species compatible with a host cell are used in connection 
with the host. The vector ordinarily carries a replicon site, as well as specific genes which are capable of 
providing phenotypic selection in transformed cells. The expression of the fusion protein can also be placed 
under control with other regulatory sequences which may be homologous to the organism in its untransfor- 
med state. For example, lactose-dependent E. coli chromosomal DNA comprises a lactose or lac operon 

40 which mediates lactose utilization by elaborating the enzyme beta-galactosidase. The lac control elements 
may be obtained from bacterial phage lambda plac5, which is infective for E. coli . The lac promoter- 
operator system can be induced by IPTG. 

Other promoters/operator systems or portions thereof can be employed as well. For example, colicin 
E1 , galactose, alkaline phosphatase, tryptophan, xylose, tax, and the like can be used. 

45 For mammalian hosts, several possible vector systems are available for expression. One class of 
vectors utilize DNA elements which are derived from animal viruses such as bovine papilloma virus, 
polyoma virus, adenovirus, vaccinia virus, baculovirus, retroviruses RSV, MMTV or MOMLV), or SV40 virus. 
Cells which have stably integrated the DNA into their chromosomes may be selected by introducing one or 
more markers which allow selection of transfected host cells. The marker may provide for prototropy to an 

50 auxotrophic host, biocide resistance, e.g., antibiotics, or heavy metals such as copper or the like. The 
selectable marker gene can be either directly linked to the DNA sequences to be expressed, or introduced 
into the same cell by cotransformation. Additional elements may also be needed for optimal synthesis of 
mRNA. These elements may include splice signals, as well as transcriptional promoters, enhancers, and 
termination signals. The cDNA expression vectors incorporating such elements includes those described by 

55 Okayama, H., Mol. Cel. Biol. , 3:280 (1983) and others. 

Once the vector or DNA sequence containing the constructs has been prepared for expression, the 
DNA constructs may be introduced to an appropriate host. Various techniques may be employed, such as 
protoplast fusion, calcium phosphate precipitation, electroporation or other conventional techniques. After 
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the fusion, the cells are grown in media. and screened for the appropriate activity. Expression of the gene(s) 
results in production of the fusion protein. This expressed fusion protein may then be subject to further 
assembly to form the immunoglobulin-like molecule. 

The host cells for immunoglobulin production may be immortalized cells, primarily myeloma or 

5 lymphoma cells. These cells may be grown in appropriate nutrient medium in culture flasks or injected into 
a synergistic host, e.g., mouse or a rat, or immunodeficient host or host site, e.g., nude mouse or hamster 
pouch. In particular, the cells may be introduced into the abdominal cavity of an animal to allow production 
of ascites fluid which contains the immunoglobulin-like molecule. Alternatively, the cells may be injected 
subcutaneously and the chimeric antibody is harvested from the blood of the host. The cells may be used 

to in the same manner as hybridoma cells. See Diamond et al. , N. Eng. J. Med. 304:1344 (1981), and Kennatt, 
McKearn and Bechtol (Eds.), Monoclonal Antibodies: Hybridomas: - A New Dimension in Biologic Analysis , 
Plenum, 1980. 

The fusion proteins and immunoglobulin-like molecules of the invention may be isolated and purified in 
accordance with conventional conditions, such as extraction, precipitation, chromatography, affinity 

75 chromatography, electrophoresis or the like. For example, the lgG1 fusion proteins may be purified by 
passing a solution through a column which contains immobilized protein A or protein G which selectively 
binds the Fc portion of the fusion protein. See, for example, Reis, K.J., et al. , J. Immunol. 132:3098-3102 
(1984); PCT Application, Publication No. WO87/00329. The chimeric antibody may the be eluted by 
treatment with a chaotropic salt or by elution with aqueous acetic acid (1 M). 

20 Alternatively the fusion proteins may be purified on anti-CD4 antibody columns, or on anti-im- 
munoglobulin antibody columns. 

In one embodiment of the invention, cDNA sequences which encode CD4, or a fragment thereof which 
binds gp120, may be ligated into an expression plasmid which codes for an antibody wherein the variable 
region of the gene has been deleted. Methods for the preparation of genes which encode the heavy or light 

25 chain constant regions of immunoglobulins are taught, for example, by Robinson, R. et al. , PCT Application, 
Publication No. WO87-02671. 

Preferred immunoglobulin-like molecules which contain CD4, or fragments thereof, contain the constant 
region of an IgM, lgG1 or lgG3 antibody which binds complement at the Fc region. 

The fusion protein and immunoglobulin-like molecules of the invention may be used for the treatment of 

30 HIV viral infections. The fusion protein complexes to gp120 which is expressed on infected cells. Although 
the inventor is not bound by a particular theory, it appears that the Fc portion of the hybrid fusion protein 
may bind with complement, which mediates destruction of the cell. In this manner, infected cells are 
destroyed so that additional viral particle production is stopped. 

For the purpose of treating HIV infections, the fusion protein or immunoglobulin-like molecule of the 

35 invention may additionally contain a radiolabel or therapeutic agent which enhances destruction of the HIV 
particle or HIV-infected cell. 

Examples of radioisotopes which can be bound to the fusion protein or immunoglobulin-like molecule of 
the invention for use in HIV-therapy are 125 l, 13l l, 90 Y, 67 Cu, 217 Bi, 2l1 At, 212 Pb, * 7 Sc, and 109 Pd. Optionally, 
a label such as boron can be used which emits a and 0 particles upon bombardment with neutron radiation. 

40 For in vivo diagnosis radionucleotides may be bound to the fusion protein or immunoglobulin-like 
molecule of the invention either directly or by using an intermediary functional group. An intermediary group 
which is often used to bind radioisotopes, which exist as metallic cations, to antibodies is 
diethylenetriaminepentaacetic acid (DTPA). Typical examples of metallic cations which are bound in this 
manner are 99m Tc 123 l, 111 ln, ,31 l, 97 Ru, &7 Cu, 67 Ga, and 68 Ga. 

45 Moreover, the fusion protein and immunoglobulin-like molecule of the invention may be tagged with an 
NMR imaging agent which include paramagnetic atoms. The use of an NMR imaging agent allows the in 
vivo diagnosis of the presence of and the extent of HIV infection within a patient using NMR techniques. 
Elements which are particularly useful in this manner are 157 Gd, 55 Mn, 162 Dy, 52 Cr, and 56 Fe. 

Therapeutic agents may include, for example, bacterial toxins such as diphtheria toxin, or ricin. Methods 

so for producing fusion proteins comprising fragment A of diphtheria toxin are taught in U.S. Patent 4,675,382 
(1987). Diphtheria toxin contains two polypeptide chains. The B chain binds the toxin to a receptor on a cell 
surface. The A chain actually enters the cytoplasm and inhibits protein synthesis by inactivating elongation 
factor 2, the factor that translocates ribosomes along mRNA concomitant with hydrolysis of ETP. See 
Darnell, J., et al. , in Molecular Cell Biology , Scientific American Books, Inc., page 662 (1986). Alternatively, 

55 a fusion protein comprising ricin, a toxic lectin, may be prepared. 

Introduction of the chimeric molecules by gene therapy may also be contemplated, for example, using 
retroviruses or other means to introduce the genetic material encoding the fusion proteins into suitable 
target tissues. In this embodiment, the target tissues having the cloned genes of the invention may then 
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produce the fusion protein in vivo . 

The dose ranges for the administration of the fusion protein or immunoglobulin-like molecule of the 
invention are those which are large enough to produce the desired effect whereby the symptoms of HIV or 
SIV infection are ameliorated. The dosage should not be so large as to cause adverse side effects, such as 

5 unwanted cross-reactions, anaphylactic reactions, and the like. Generally, the dosage will vary with the age, 
condition, sex and extent of disease in the patient, counterindications, if any, immune tolerance and other 
such variables, to be adjusted by the individual physician. Dosage can vary from .01 mg/kg to 50 mg/kg, 
preferably 0.1 mg/kg to 1.0 mg/kg, of the immunoglobulin-like molecule in one or more administrations 
daily, for one or several days. The immunoglobulin-like molecule can be administered parenterally by 

w injection or by gradual perfusion over time. They can be administered intravenously, intraperitoneal^, 
intramuscularly, or subcutaneously. 

Preparations for parenteral administration include sterile or aqueous or non-aqueous solutions, suspen- 
sions, and emulsions. Examples of non-aqueous solvents are propylene glycol, polyethylene glycol, 
vegetable oils such as olive oil, and injectable organic esters such as ethyl oleate. Aqueous carriers include 

75 water, alcoholic/aqueous solutions, emulsions or suspensions, including saline and buffered media. Paren- 
teral vehicles include sodium chloride solution, Ringer's dextrose, dextrose and sodium chloride, lactated 
Ringer's, or fixed oils. Intravenous vehicles include fluid and nutrient replenishers, electrolyte replenishers, 
such as those based on Ringer's dextrose, and the like. Preservatives and other additives may also be 
present, such as, for example, antimicrobials, antioxidants, chelating agents, inert gases and the like. See, 

20 generally, Remington's Pharmaceutical Science , 16th Ed., Mack Eds., 1980. 

The invention also relates to a method for preparing a medicament or pharmaceutical composition 
comprising the components of the invention, the medicament being used for therapy of HIV or SIV infection 
in animals. 

The detection and quantitation of antigenic substances and biological samples frequently utilized 
25 immunoassay techniques. These techniques are based upon the formation of the complex between the 
antigenic substance, e.g., gp120, being assayed and an antibody or antibodies in which one or the other 
member of the complex may be detectably labeled. In the present invention, the immunoglobulin-like 
molecule or fusion protein may be labeled with any conventional label. 

Thus, the hybrid fusion protein or immunoglobulin-like molecule of the invention can also be used in 
30 assay for HIV or SIV viral infection in a biological sample by contacting a sample, derived from an animal 
suspected of having an HIV or SIV infection, with the fusion protein or immunoglobulin-like molecule of the 
invention, and detecting whether a complex with gp120, either alone or on the surface of an HIV-infected 
cell, has formed. 

For example, a biological sample may be treated with nitrocellulose, or other solid support which is 
35 capable of immobilizing cells, cell particles or soluble protein. The support may then be washed with 
suitable buffers followed by treatment with the fusion protein which may be detectably labeled. The solid 
phase support may then be washed with the buffer a second time to remove unbound fusion protein and 
the label on the fusion protein detected. 

In carrying out the assay of the present invention on a sample containing gp120, the process 
40 comprises: 

a) contacting a sample suspected containing gp120 with a solid support to effect immobilization of 
gp120, or cell which expresses gp120 on its surface; 

b) contacting said solid support with the detectably labeled immunoglobulin-like molecule or fusion 
protein of the invention; 

45 c) incubating said detectably labeled immunoglobulin-like molecule with said support for a sufficient 
amount of time to allow the immunoglobulin-like molecule or fusion protein to bind to the immobilized 
gp120 or cell which expresses gp120 on its surface; 

d) separating the solid phase support from the incubation mixture obtained in step c); and 

e) detecting the bound immunoglobulin-like molecule or fusion protein and thereby detecting and 
50 quantifying gp120. 

Alternatively, labeled immunoglobulin-like molecule (or fusion protein) -gp120 complex in a sample may 
be separated from a reaction mixture by contacting the complex with an immobilized antibody or protein 
which is specific for an immunoglobulin or, e.g., protein A, protein G, anti-IgM or anti-IgG antibodies. Such 
anti-immunoglobulin antibodies may be monoclonal or polyclonal. The solid support may then be washed 
55 with suitable buffers to give an immobilized gp120-labeled immunoglobulin-like molecule antibody complex. 
The label on the fusion protein may then be detected to give a measure of endogenous gp120 and, 
thereby, the presence of HIV. 



o 
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This aspect of the invention relates to a method for detecting HIV or SIV viral infection in a sample 
comprising 

(a) contacting a sample suspected of containing gp120 with a fusion protein or immunoglobulin-like 
molecule comprising CD4, or fragment thereof which binds to gp120, and the Fc portion of an 

5 immunoglobulin chain, 

(b) detecting whether a complex is formed. 

The invention also relates to a method of detecting gp120 in a sample, further comprising 

(c) contacting the mixture obtained in step (a) with an Fc binding molecule, such as an antibody, protein 
A, or protein G, which is immobilized on a solid phase support and is specific for the hybrid fusion 

70 protein, to give a gp120 fusion protein-immobilized antibody complex 

(d) washing the solid phase support obtained in step (c) to remove unbound fusion protein, 

(e) and detecting the label on the hybrid fusion protein. 

Of course, the specific concentrations of detectably labeled immunoglobulin-like molecule (or fusion 
protein) and gp120, the temperature and time of incubation, as well as other assay conditions may be 

75 varied, depending on various factors including the concentration of gp120 in the sample, the nature of the 
sample, and the like. Those skilled in the art wild be able to determine operative and optimal assay 
conditions for each determination by employing routine experimentation. 

Other such steps as washing, stirring, shaking, filtering and the like may be added to the assays as is 
customary or necessary for the particular situation. 

20 One of the ways in which the immunoglobulin-like molecule or fusion protein of the present invention 
can be detectably labeled is by linking the same to an enzyme. This enzyme, in turn, when later exposed to 
its substrate, will react with the substrate in such a manner as to produce a chemical moiety which can be 
detected as, for example, by spectrophotometry, fluorometric or by visual means. Enzymes which can be 
used to detectably label the immunoglobulin-like molecule or fusion protein of the present invention include, 

25 but are not limited to, malate dehydrogenase, staphylococcal nuclease, delta- V-steroid isomerase, yeast 
alcohol dehydrogenase, alpha-glycerophosphate dehydrogenase, triose phosphate isomerase, horseradish 
peroxidase, alkaline phosphatase, asparaginase, glucose oxidase, beta-galactosidase, ribonuclease, urease, 
catalase, glucose-VI-phosphate dehydrogenase, glucoamylase and acetylcholine esterase. 

The immunoglobulin-like molecule or fusion protein of the present invention may also be labeled with a 

30 radioactive isotope which can be determined by such means as the use of a gamma counter or a 
scintillation counter or by autoradiography. Isotopes which are particularly useful for the purpose of the 
present invention are: 3 H, 125 l, ,31 l, 32 P, 35 S, U C, 5, Cr, 3& CI, 57 Co, 58 Co, 59 Fe and 75 Se. 

It is also possible to label the immunoglobulin-Nke molecule or fusion protein with a fluorescent 
compound. When the fluorescently labeled immunoglobulin-like molecule is exposed to light of the proper 

35 wave length, its presence can then be detected due to the fluorescence of the dye. Among the most 
commonly used fluorescent labelling compounds are fluorescein isothiocyanate, rhodamine, phycoerytherin, 
phycocyanin, allophycocyanin, o-phthaldehyde and fluorescamine. 

The immunoglobulin-like molecule or fusion protein of the invention can also be detectably labeled 
using fluorescence emitting metals such as 152 Eu, or others of the lanthanide series. These metals can be 

40 attached to the immunoglobulin-like molecule or fusion protein using such metal chelating groups as 
diethylenetriaminepentaacetic acid (DTPA) or ethylenediaminetetraacetic acid (EDTA). 

The immunoglobulin-like molecule or fusion protein of the present invention also can be detectably 
labeled by coupling it to a chemiluminescent compound. The presence of the chemiluminescent-tagged 
immunoglobulin-like molecule or fusion protein is then determined by detecting the presence of lumines- 

45 cence that arises during the course of a chemical reaction. Examples of particularly useful chemilumines- 
cent labeling compounds are luminol, isoluminol, theromatic acridinium ester, imidazole, acridinium salt and 
oxalate ester. 

Likewise, a bioluminescent compound may be used to label the immunoglobulin-like molecule or fusion 
protein of the present invention. Bioluminescence is a type of chemiluminescence found in biological 
so systems in which a catalytic protein increases the efficiency of the chemiluminescent reaction. The 
presence of a bioluminescent protein is determined by detecting the presence of luminescence. Important 
bioluminescent compounds for purposes of labeling are luciferin, luciferase and aequorin. 

Detection of the immunoglobulin-like molecule or fusion protein may be accomplished by a scintillation 
counter, for example, if the detectable label is a radioactive gamma emitter, or by a fluorometer, for 
55 example, if the label is a fluorescent material. In the case of an enzyme label, the detection can be 
accomplished by colorimetric methods which employ a substrate for the enzyme. Detection may also be 
accomplished by visual comparison of the extent of enzymatic reaction of a substrate in comparison with 
similarly prepared standards. 
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The assay of the present invention is ideally suited for the preparation of a kit. Such a kit may comprise 
a carrier means being compartmentalized to receive in close confinement therewith one or more container 
means such as vials, tubes and the like, each of said container means comprising the separate elements of 
the immunoassay. For example, there may be a container means containing a solid phase support, and 
further container means containing the detectably labeled immunoglobulin-like molecule or fusion protein in 
solution. Further container means may contain standard solutions comprising serial dilutions of analytes 
such as gp120 or fragments thereof to be detected. The standard solutions of these analytes may be used 
to prepare a standard curve with the concentration of gp120 plotted on the abscissa and the detection 
signal on the ordinate. The results obtained from a sample containing gp120 may be interpolated from such 
a plot to give the concentration of gp120. 

The immunoglobulin-like molecule or fusion protein of the present invention can also be used as a stain 
for tissue sections. For example, a labeled immunoglobulin-like molecule comprising CD4 or fragment 
thereof which binds to gp120 may be contacted with a tissue section, e.g., a brain biopsy specimen. This 
section may then be washed and the label detected. 

The following examples are illustrative, but not limiting the method and composition of the present 
invention. Other suitable modifications and adaptations which are obvious to this skill in the art are within 
the spirit and scope of this invention. 

EXAMPLES 

Example 1 : Preparation of CD4-lg cDNA Constructs 

The extracellular portion of the CD4 molecule (See Madden, P. J., et a). , Cell 42:93-104 (1985)) was 
fused at three locations in a human lgG1 heavy chain constant region gene by means of a synthetic splice 
donor linker molecule. To exploit the splice donor linker, a BamHI linker having the sequence 
CGCGGATCCGCG was first inserted at amino acid residue 395 of the CD4 precursor sequence (nucleotide 
residue 1295). A synthetic splice donor sequence 

GATCCCGAGGGTGAGTACTA 

GGCTCCCACTCATGATTCGA 

bounded by BamHI and Hindlil complementary ends was created and fused to the Hindlll site in the intron 
preceding the CH1 domain, to the Espl site in the intron preceding the hinge domain, and to the Banl site 
preceding the CH2 domain of the lgG1 genomic sequence. Assembly of the chimeric genes by ligation at 
the BamHI site afforded molecules in which either the variable (V) region, the V + CH1 regions, or the V, 
CH1 and hinge regions were replaced by CD4. In the last case, the chimeric molecule is expected to form a 
monomer structure, while in the former, a dimeric molecule is expected. 

On such genetic construct which contains the DNA sequence which encodes CD4 linked to human lgG1 
at the Hind 3 site upstream of the CH1 region (fusion protein CD4H 7 1) is depicted in Table I.The plasmid 
containing this genetic construct (pCD4H 7 1) has been deposited in E. coli (MC1061/P3) at the American 
Type Culture Collection (ATCC) under the terms of the Budapest Treaty and given accession number 
67611. 

A second genetic construct which contains the DNA sequence which encodes CD4 linked to human 
IgGl at the Esp site upstream of the hinge region (fusion protein CD4E 7 1) is depicted in Table 2. The 
plasmid containing this genetic construct (pCD4E 7 1) has been deposited in E. coli (MC1061/P3) at the 
ATCC under the terms of the Budapest Treaty and given accession number 67610. 

A third genetic construct which contains the DNA sequence which encodes CD4 linked to human IgM at 
the Mst2 site upstream of the CH1 region (fusion protein CD4Mu) is depicted in Table 3. The plasmid 
containing this genetic construct (PCD4Mu) has been deposited in E. coli (MC1061/P3) at the ATCC under 
the terms of the Budapest Treaty and given accession number 67608. 

A fourth genetic construct which contains the DNA sequence which encodes CD4 linked to human IgM 
at the Pst site upstream of the CH2 region (fusion protein CD4Pu) is depicted in Table 4. The plasmid 
containing this genetic construct (PCD4Pu) has been deposited in E. coli (MC1061/P3) at the ATCC under 
the terms of the Budapest Treaty and given accession number 67609. 

A fifth genetic construct which contains the DNA sequence which encodes CD4 linked to human lgG1 at 
the Ban1 site downstream from the hinge region (fusion protein CD4B 7 1) is depicted in Table 5. 
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Two similar constructs were prepared from the human IgM heavy chain constant region by fusion with 
the introns upstream of the u CH1 and CH2 domains at an Mstll site and a Pstl site respectively. The 
fusions were made by joining the Pstl site of the CD4/lgG1 construct fused at the Esp site in lgG1 gene to 
the Mstll and Pst sites in the IgM gene. In the first instance, this was performed by treatment of the Pst end 

5 with T4 DNA Polymerase and the Mstll end with E. coli DNA Polymerase, followed by ligation; and in the 
second instance, by ligation alone. 

Immunoprecipitation of the fusion proteins with a panel of monoclonal antibodies directed against CD4 
epitopes showed that all of the epitopes were preserved. A specific high affinity association is demonstrated 
between the chimeric molecules and HIV envelope proteins expressed on the surface of cells transfected 

70 with an attenuated (reverse transcriptase deleted) proviral construct. 
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Table 1 



FN S B 

N S B M H DHA S 

UP B N G RAU T 

4 B V L A AE9 X 

H 2 1 1 1 236 1 

/ 



GCCTGTTTGAGAAGCAGCGGGCAAGAAAGACGCAAGCCCAGAGGCCCTGCCATTTCTGTG 

_ _„„4 - -------- 4 

CGGACAAACTCTTCGTCGCCCGTTCTTTCTGCGTTCGGGTCTCCGGGACGGTAAAGACAC 



B PS S S 

DBS ADNPA D DHNA M HM HNC 

DAP VRLUU D RALU . N AN PCR 

EN1 AAAM9 E AEA9 L EL AIF 

122 22416 1 2346 1 31 211 

I I II I I I 
GGCTCAGGTCCCTACTGGCTCAGGCCCCTGCCTCCCTCGGCAAGGCCACAATGAACCGGG 



61 



120 



121 



181 



CCGAGTCCAGGGATGACCGAGTCCGGGGACGGAGGGAGCCGTTCCGGTGTTACTTGGCCC 

M N R G - 

H F F 

I B N HH NMD 

N B U HA U N D 

F V 4 AE 4 L E 

1 l H 12 H 1 1 

GAGTCCCTTTTAGGCACTTGCTTCTGGTGCTGCAACTGGCGCTCCTCCCAGCAGCCACTC 

4 _ „ 4 ♦ — 4 — ♦ — ♦ 180 

CTCAGGGAAAATCCGTGAACGAAGACCACGACGTTGACCGCGAGGAGGGTCGTCGGTGAG 

V P F R H L L LVLQLALLPAATQ- 

B EE R A 

B C C S L 

V 0 0 A U 

1 K K 1 1 

AGGGAAAGAAAGTGGTGCTGGGCAAAAAAGGGGATACAGTGGAACTGACCTGTACAGCTT 

4 + ♦ ♦ ♦ -♦ 240 

TCCCTTTCTTTCACCACGACCCGTTTTTTCCCCTATGTCACCTTGACTGGACATGTCGAA 

GKKVVLGKKGDT VELTCTAS- 
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70 



30 



35 



H 

MM I 
B B N 
0 0 F 

2 2 1 
CCCAGAAGAAGAGCATACAAmCACTCGAAAAACTCCAACCAGATAAAGATTCTCGGAA 

24i ♦ ♦ ♦ ♦ ♦ 300 

GGGTCnCTTCTCGTATGTTAAGGTGACCTTTTTGAGGTTGGTCTATTTCTAAGACCCTT 

QKKSI QFHWKNSNQIKILGN- 



K S 


I Q F 


H W K 


N S 


N Q 


I K 


I 


B 




5 




S 


F 


H 


NBS 


F 


AA 


A 


A 


N H 


I 


LAP 


0 


VU 


L 


U 


U H 


N 


AN1 


K 


A9 


U 


3 


D A 


F 


422 


1 


26 


1 


A 


2 1 


1 


/ 




/ 











T5 



ATCAGGGCTCCTTCTTAACTAAAGGTCCATCCAAGCTGAATGATCGCGCTGACTCAAGAA 

20 301 «. ♦ ♦ ♦ * ♦ 360 

TAGTCCCGAGGAAGAATTGATTTCCAGGTAGGTTCGACTTACTAGCGCGACTGAGTTCTT 

QGSFLTKGPSKLNDRADSRR- 

25 S S H H 

MANAS BA I A ID 

BVLUT CU N F N D 

0AA9Y L3 F L F E 

22461 1A 12 11 

/ / 
GAAGCCTTTGGGACCAAGGAAACTTCCCCCTGATCATCAAGAATCTTAAGATAGAAGACT 

361 ♦ ♦ ♦ ♦ ♦ ♦ 4 20 

CTTCGGAAACCCTGGTTCCTTTGAAGGGGGACTAGTAGTTCTTAGAATTCTATCTTCTGA 



SLWDQGNFPLIIKNLKIEDS- 



S 

M M AM AM M 

B N VNUN A 

4 ° 0 L AL9L E 

2 1 2161 1 

CAGATACTTACATCTGTGAAGTGGAGGACCAGAAGGAGGAGGTGCAATTGCTAGTGTTCG 

42i ♦ + ♦ ♦ ♦ ♦ 480 

« GTCTATGAATGTAGACACTTCACCTCCTGGTCTTCCTCCTCCACGTTAACGATCACAAGC 

D TYICEVEDQKEEVRLLVFG- 
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B 

s s 

P T 
M Y 
1 1 

CATTGACTCCCAACTCTGACACCCACCTGCTTCAGGGGCAGAGCCTGACCCTCACCTTGG 

481 ♦ ♦ ♦ ♦ ♦ 540 

CTAACTGACGGTTGAGACTGTGGGTGGACGAAGTCCCCGTCTCGGACTGGGACTGGAACC 

LTANS DTHLLRGQSLTLTLE- 
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/ / 

AGAGCCCCCCTGGTAGTAGCCCCTCAGTGCAATGTAGGAGTCCAAGGGGTAAAAACATAC 

541 ♦ ♦-- ♦ ♦ 4 ♦ 600 

TCTCGGGGGGACCATCATCGGGGAGTCACGTTACATCCTCAGGTTCCCCATTTTTGTATG 

SPPGSSPSVQCRSPRGKNIQ- 
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AGGGGGGGAAGACCCTCTCCGTGTCTCAGCTGGAGCTCCAGGATAGTGGCACCTGGACAT 

601 4 4 4 4- — 4 — 4 660 

TCCCCCCCTTCTGGGAGAGGCACAGAGTCGACCTCGAGGTCCTATCACCGTGGACCTGTA 
GGKTLSVSQLELQDSGTWTC- 

N 

NS M NM A 

LP B HA L 

AH 0 EE U 

31 2 11 1 

GCACTGTCTTGCAGAACCAGAAGAAGGTGGAGTTCAAAATAGACATCGTGGTGCTAGCTT 

661 4 4 4 4 ♦ 4 720 

CGTGACAGAACGTCTTGGTCTTCTTCCACCTCAAGTTTTATCTGTAGCACCACGATCGAA 
TVLRNRKKVEFKIDIVVLAF- 
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HS MM 
AT N N 

EU L L 

31 11 

TCCAGAAGGCCTCCAGCATAGTCTATAAGAAAGAGGGGGAACAGGTGGAGTTCTCCTTCC 

721 ♦ ♦ ♦ ♦ ♦ ♦ 780 

AGGTCTTCCGGAGGTCGTATCAGATATTCTTTCTCCCCCTTGTCCACCTCAAGAGGAACG 

QKASSIVYKKEGEQVEFSFP- 

A AM 

L L N 

U U L 

1 1 1 

CACTCGCCTTTACAGTTGAAAAGCTGACGGGCAGTGGCGAGCTGTGGTCGCAGGCGGAGA 

781 — ♦ ♦ ♦ - -— ♦ 840 

CTGAGCGGAAATGTCAACTTTTCGACTGCCCGTCACCGCTCGACACCACCGTCCGCCTCT 

LAFTVEKLTGSGELWWQAER- 
P S 

H M FM A M 
P N LN U B 
H L ML 3 0 
1 1 11 A 2 
GGGCTTCCTCCTCCAAGTCTTGGATCACCTTTGACCTGAAGAACAAGGAAGTGTCTGTAA 

g41 4 ♦ ♦ — * ♦ 900 

CCCGAAGGAGGAGGTTCAGAACCTAGTGGAAACTGGACTTCTTGTTCCTTCACAGACATT 

ASSSKSWITFDLKNKEVSVK- 
B BS PS 

SM SCADNPAD A AH 
TA TRVRLUUD L LP 
EE NFAAAM9E U UH 
23 11224161 1 11 

AACGGGTTACCCAGGACCCTAAGCTCCAGATGGGCAAGAAGCTCCCGCTCCACCTCACCC 

901 4 4— * — 4 4 4 960 

TTGCCCAATGGGTCCTGGGATTCGAGGTCTACCCGTTCTTCGAGGGCGAGGTGGAGTGGG 

RVTQDPKLQMGKKLPLHLTL- 
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BS BSS 

M SC HS D M H SCAHM 

N TR AT D N P TRUAN 

L NFEU E L H NF9EL 

1 11 31 1 11 11631 

/ / / / 

TGCCCCAGGCCTTGCCTCAGTATGCTGGCTCTGGAAACCTCACCCTGGCCCTTGAAGCGA 

961 ♦ ♦ ♦ ♦ ♦ ♦ 1020 

ACGGGGTCCGGAACGGAGTCATACGACCGAGACCTTTGGAGTGGGACCGGGAACTTCGCT 

PQALPQYAGSGNLTLALEAK- 

S BS 

F SC H D A 

A TR P D L 

N NF H E U 

1 11 111 

/ 

AAACACGAAAGTTGCATCAGGAAGTGAACCTGGTGGTGATGAGAGCCACTCAGCTCCACA 

1021 ♦ ♦ ♦ ♦ ♦ 1080 

TTTGTCCTTTCAACGTAGTCCTTCACTTGGACCACCACTACTCTCGGTGAGTCGAGGTCT 

TGKLHQEVNLVVMRATQLQK- 

PS S 

M ADNNPA DF AM DE A 

N VRLLUU DA LN DS L 

L AAAAM9 EN UL EP U 

1 224416 11 11 11 1 

///// / / / 

AAAATTTGACCTGTGAGGTGTGGGGACCCACCTCCCCTAAGCTGATGCTGAGCTTGAAAC 

1081 - ♦ ♦ -- ♦ 4— ♦ 1140 

TTTTAAACTGGACACTCCACACCCC7GGGTGGAGGGGATTCGACTACGACTCGAACTTTG 

NLTCEVWGPTSP. KLMLSLKL- 

M T H W DM 

N A P N DS 

L Q A L ET 

1 1 2 1 12 

/ 

TGGAGAACAAGGAGGCAAAGCTCTCGAAGCGGGAGAAGCCGGTGTGGGTGCTGAACCCTG 

1141 ♦ ♦ ♦ ♦ - 1200 

ACCTCTTGTTCCTCCGTTTCCAGAGCTTCGCCCTCTTCGGCCACACCCACGACTTCGGAC 

ENKEAKVSKREKPVWVLNPE- 
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H PS H 

F D M I A ADPA I 

0 D A N V VRUU N 
K E E FA AAM9 F 

1 13 11 2216 1 

/// 

AGGCGGGGATGTGGCAGTGTCTGCTGAGTGACTCGGGACAGGTCCTGCTGGAATCCAACA 

1201 ♦ ♦ ♦ ♦ 4 ♦ 1260 

TCCGCCCCTACACCGTCACAGACGACTCACTGAGCCCTGTCCAGGACGACCTTAGGTTGT 

AGMWQCLL SDSGQVLLES NI- 



S SA BHF BS H 

ANA HNCP 5GNMAANXA RSD I A 

's VLU PCRA PIUNMULHV SCD N L 

AA9 AIFL 1ADLH3A0A AAE D U 

236 2111 21211A421 111 3 1 

// II I II I I 

TCAAGGTTCTGCCCACATGGTCCACCCCGGTGCACGCGGATCCCGAGGGTGAGTACTAAG 

20 1261 ♦ ♦ * ♦ ♦ * 1320 

AGTTCCAAGACGGGTGTACCAGGTGGGGCCACGTGCGCCTAGGGCTCCCACTCATGATTC 

KVLPTWSTPVHADPE 

BS B 

H HSCHSS M MDS 

p ATRATT N NDP 

H ENFEUY L LEM 

1 3 11 31 1 1 111 

30 CTTTCTGGGGCAGGCCAGGCCTGACCTTGGCTTTGGGGCAGGGAGGGGGCTAAGGTGAGG 

j32i ♦ ♦ ♦ 4 ♦ ♦ 1380 

GAAAGACCCCGTCCGGTCCCGACTGGAACCGAAACCCCGTCCCTCCCCCGATTCCACTCC 

B A BH B P 

BASHBHHNN P SG N BS F H 

AHPHBAPAL A PI I AP L G 

NAMAEEHRA L 1A A Nl M A 

121112114 1 21 3 22 1 1 

CAGGTGGCGCCAGCAGGTGCACACCCAATGCCCATGAGCCCAGACACTGGACGCTGAACC 

40 13 8l ♦ — 4 — * — 4- 4 1440 

GTCCACCGCGGTCGTCCACGTGTGGGTTACGGGTACTCGGGTCTGTGACCTGCGACTTGG 

F BS S B SS B S FN 

N M SC DNHA H SMAAHNABSAC NS 

45 UN TR RLAU H TNUUALPAPLR UP 

D L NF AAE9 A NL99EAAN1UF DB 

2 1 U 2436 1 11663412211 22 

/ / / // / / 

TCCCGGACAGTTAAGAACCCAGGGGCCTCTGCGCCTGCGCCCAGCTCTGTCCCACACCGC 



so 1441 



1500 



AGCGCCTGTCAATTCTTGGGTCCCCGGAGACGCGGACCCGGGTCGAGACAGGGTGTGGCG 
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F BSS BS 

MS BNN NM S BMDMHNABSAA SCB 

AA ALL UN T BBRNALPAPUU TRA 

EC NAA 4L Y V0ALEAAN199 NFN 

32 134 HI 1 12213412266 111 

/ / / / //// / 

GGTCACATGGCACCACCTCTCTTGCAGCCTCCACCAAGGGCCCATCCGTCTTCCCCCTGG 

1501 — ♦ ♦ *— ♦ -♦ 1560 

CCAGTGTACCGTCGTGGAGAGAACGTCGCAGGTGGTTCCCGGCTAGCCAGAAGGGGGACC 

ASTKGPSVFPLA- 

BH B NFS BS F BS 

N M MSG MSB SNAH SC N SC 

L N NPI NPB PUUA TR U TR 

A L L1A L1V B49E NF 4 NF 

4 1 121 121 2H63 11 H 11 

/ / / 

CACCCTCCTCCAAGAGCACCTCTGGGGGCACAGCGGCCCTGGGCTGCCTGGTCAAGGACT 

1561 ♦ ♦ ♦ ♦— ♦ ♦ 1620 

GTGGGAGGAGGTTCTCGTGGAGACCCCCGTGTCGCCGGGACCCGACGGACCAGTTCCTGA 
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ACTTCCCCGAACCGGTGACGGTGTCGTGGAACTCAGGCCCCCTGACCAGCGGCGTCCACA 

1621 ♦ ♦ ♦ ♦ ♦ * 1680 

TGAAGGGGCTTGGCCACTGCCACAGCACCTTGAGTCCGCGGGACTGGTCGCCCCACGTGT 

FPEPVTVSWNSGALTSGVHT- 
S 

HNC 
PCR 
AIF 
211 

CCTTCCCGGCTGTeCTACAGTCCTCAGGACTCTACTCCCTCAGCAGCGTGGTGACCGTGC 

1681 ♦ -4 * 4 — * ♦ 1740 

GGAAGGGCCGACAGGATGTCAGGAGTCCTGAGATGAGGGAGTCCTCGCACCACTGGCAGG 
FPAVLRSSGLYSLSSVVTVP- 

B F B B H 
SH N ASM B NSB MI 
PP U LTN A LPB A N 

1H 4 UXL N A1V E F 

21 H 111 1 421 2 1 

CCTCCAGCAGCTTGGGCACCCAGACCTACATCTGCAACGTGAATCACAAGCCCAGCAACA 

1741 -4 4 ♦ ♦ ♦ 1800 

GGAGGTCGTCGAACCCGTCGGTCTGGATGTACACGTTCCACTTAGTGTTCGGGTCGTTCT 
SSSLGTQTYICNVNHKPSNT- 
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S M KM HM 

T N AN PN 

Y L EL HL 

1 1 31 11 

CCAAGGTCGACAAGAAACTTGGTGACAGCCCAGCACAGGGAGGGAGGCTCTCTGCTGGAA 

1801 ♦ ♦ ♦ ♦ ♦ 4 1860 

GGTTCCACCTGTTCTTTCAACCACTCTCCGGTCGTGTCCCTCCCTCCCACAGACGACCTT 

K V D K K V 

E BS 

DE CHH F SC 

DS OHA 0 TR 

EP 4AE K NF 

11 712 1 11 

GCAGGCTCAGCGCTCCTGCCTGGACGCATCCCGGCTATGCAGCCCCAGTCCAGGGCAGCA 

1861 ♦ — ♦ ♦- ♦ -♦ 1920 

CGTCCGAGTCGCGAGGACCGACCTGCGTAGGGCCGATACGTCGGCGTCAGGTCCCGTCGT 

S S 

DBHMHNA HMNCN M MNDM 

RBABPLU PNCRL N NLDB 

AVE0HA9 ALIFA L LAEO 

2132146 21114 1 1312 

AGGCAGGCCCCGTCTGCCTCTTCACCCGGAGCCTCTGCCCGCCCCACTCATGCTCAGGGA 

J921 ♦ ♦ ♦ 4 ♦ * 1980 

TCCGTCCGGGGCAGACGGAGAAGTGGGCCTCGGAGACGGGCGGGGTGAGTACGAGTCCCT 

BS P B BS 

SC F M B N S SC 

TR L A A L P TR 

NF M E N A 1 NF 

11 1 114 2 11 

/ / 
GAGGGTCnCTGGCTTTTTCCCAGGCTCTGGGCAGGCACAGGCTAGGTGCCCCTAACCCA 

1981 ♦ 4 ♦ *— * 2040 

CTCCCAGAAGACCGAAAAAGGGTCCGAGACCCGTCCGTGTCCGATCCACGGGGATTGGGT 

S B B 

DHA S DBS 

RAU P DAP 

AE9 M EN1 

236 1 122 

/ / 
GGCCCTGCACACAAAGGGGCAGGTGCTGGGCTCAGACCTGCCAAGAGCCATATCCGGGAG 

2041 — ♦ ♦ ♦ ♦ ♦ ♦ 2100 

CCGGGACGTGTGTTTCCCCGTCCACGACCCGAGTCTGGACGGTTCTCGGTATAGGCCCTC 
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PS 

DNPA D H DAM 

RLUU D A D L N 

AAM9 E E E U L 

2416 1 3 111 

GACCCTGCCCCTGACCTAAGCCCACCCCAAAGGCCAAACTCTCCACTCCCTCAGCTCGGA 

2101 ♦ ♦ * ♦ ♦ ♦ 2160 

CTGGGACGGGGACTGGATTCGGGTGGGGTTTCCGGTTTGAGACGTGAGGGAGTCGAGCCT 

H B 

I M MM P BS 

N N AB S AP 

F L ED T Nl 

1 1 32 1 22 

/ / 
CACCTTCTCTCCTCCCAGATTCCAGTAACTCCCAATCTTCTCTCTGCAGAGCCCAAATCT 

2161 ♦ - « ♦ ♦ — 2220 

GTGCAAGAGAGGAGGGTCTAAGGTCATTGAGGGTTAGAAGAGAGACGTCTCGGGTTTAGA 

E P K S - 

N BBS BS 

M NS SSC SC HS M 

A LP PTR TR AT N 

E AH INF NF EU L 

3 31 211 11 31 1 

/ / II 

TGTGACAAAACTCACACATGCCCACCGTGCCCAGCTAAGCCAGCCCAGGCCTCGCCCTCC 

2221 * ♦ ♦- ♦ ♦ ♦ 2280 

ACACTCTTTTGAGTGTGTACGGGTGGCACGGGTCCATTCGGTCGGGTCCGGAGCGGGAGG 

CDKTHTC PPCP 

B BS S S S 

AM B N SM F SC F DHNA HNC 

L N A L PA 0 TR A RALU PCR 

U L N A IE K NF N AEA9 AIF 

, , 1 4 21 1 11 1 2346 211 

III 
AGCTCAAGGCGGGACAGGTGCCCTAGAGTAGCCTGCATCCAGGGACAGGCCCCAGCCGGG 

OOOT ^ 4 4 * * 

TCGAGTTCCGCCCTCTCCACGGGATCTCATCGGACGTAGGTCCCTGTCCGGGGTCGGCCC 



2340 



BS S 

A M M M D M SC M ANA M 

fl B NO N TR BVLUB 

LEO L E L NF 0 AA9 0 

3 2 2 1 1 1 11 2 246 2 

/ / 

TGCTCACACGTCCACCTCCATCTCTTCCTCAGCACCTGAACTCCTCCCGCGACCCTCACT 

2341 ♦ ♦ — ♦ ♦ ♦ ♦ 2400 

ACGACTGTGCAGGTGGAGGTAGAGAAGGAGTCGTGGACTTGAGGACCCCCCTGGCAGTCA 

APELLGGPSV- 
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S SS 

MS AN M HMANNAC DM M 

NT UL N PNVCLUR DS A 

L Y 3A L ALAIA9F ET E 

11 A3 1 2121461 12 3 

CTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCCTCAGGTCAC 
2401 ♦ ♦ ♦ 4 4 ♦ 2460 

CAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGCCCTCGGGACTCCAGTG 
FLFPPKPKDTLMISRTPEVT- 
N 

NS M M DM M RM M 

LP A N DS B SA N 

AH E L ET 0 AE L 

31 2 1 12 2 12 1 

/ / 
ATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGA 

2461 4 4 4 4 4 . 4 2520 

TACGCACCACCACCTGCACTCGCTCCTTCTGGGACTCCAGTTCAAGTTGACCATGCACCT 
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CGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGCACGTA 

2521 — —4 — ♦- ♦ 4 2580 

GCCGCACCTCCACGTATTACGGT7CTGTTTCGGCGCCC7CCTCGTCATGTTGTCGTGCAT 

GVEVHNAKTKPREEQYNSTY- 

S BS 

HNC HH M SC R 

PCR GP N TR S 

AIF AH L NF A 

211 11 1 11 1 

/ / 
CCGGGTGGTCAGCGTCCTCACCGTCCTCCACCAGGACTGGCTGAATGGCAAGGAGTACAA 

2581 4 4 4 4 — 4 ♦ 2640 

GGCCCACCAGTCGCAGCAGTGGCAGGACGTGGTCCTGACCGACTTACCGTTCCTCATGTT 

RVVSVLTVLHQDWLNGKEYK- 

M T 
N A 
L Q 
1 1 

GTCCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCCAA 

2641 4 4 — — — -4 — — 2700 

CACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTTCGGTT 
CKVSNKALPAPIEK TISKAK- 
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P S S S 

ADNNPMA A H M N HHN BSAH 

VRLLUNU U A N L APA GFUA 

AAAAML9 9EL A EAE LI9E 

2244116 6 3 1 3 321 1163 
//// / / 

AGGTGGGACCCGTGGGGTGCGAGGGCCACATGGACAGAGGCCGGCTCGGCCCACCCTCTG 
2701 ♦ 4 ♦ ♦ _ 4 4 2760 

TCCACCCTGGGCACCCCACGCTCCCGGTGTACCTCTCTCCGGCCGAGCCGGGTGGGAGAC 
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CCCTGAGAGTGACCGCTGTACCAACCTCTGTCCTACAGGGCAGCCCCGAGAACCACAGGT 
2761 4 ♦ ♦ 4 4 4 2820 

GGGACTCTCACTGGCGACATGGTTGGAGACAGGATGTCCCGTCCGGGCTCTTGGTGTCCA 

GQPREPQ V- 

SS BS BS 

R F AHNNCCS A F SC SC 

S 0 VPCCRRM L 0 TR TR 

A K AAIIFFA U K NF NF 

1 1 • 1211111 1 1 11 ii 

///// / / 

GTACACCCTGCCCCCATCCCGGGATCACCTGACCAAGAACCAGC7CAGCCTGACCTGCCT 

2821 4 -4 4 4 4 4 2880 

CATGTGGGACGGGGGTAGGGCCCTACTCGACTGGTTCTTGGTCCAGTCGGACTGGACGGA 
YTIPPSRDELTKNQVSLTCL- 

B F 
S N H 

P UP 
M. 4 A 

1 H 2 

GGTCAAAGGCTTCTATCCCAGCGACATCGCCGTGGAGTGGGAGAGCAATGGGCAGCCGGA 

2861 4 4 4 4 4 2940 

CCAGTTTCCGAAGATAGGGTCGCTGTAGCGGCACCTCACCCTCTCGTTACCCGTCGGCCT 
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GAACAACTACAAGACCACGCCTCCCGTGCTGGACTCCGACGGCTCCTTCTTCCTCTACAC 

2941 4 4 4 — - 4 4 4 3000 

CTTGTTGATGTTCTGGTGCGGACCCCACGACCTGAGGCTGCCGAGGAAGAAGGAGATGTC 
NNYKTTPPVLDSDCSFFLYS- 



OO 
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B F S 

MA S NM MBX NF M 

N L P UB ABM LA N 

L U M 40 EVN AN L 

11 1 H2 211 31 1 

/ 

CAAGCTCACCGTGGACAAGAGCAGGTGGCAGCAGGGGAACGTCTTCTCATGCTCCGTGAT 

3001 ♦ ♦ ♦ ♦ ♦ ♦ 3060 

GTTCGAGTGGCACCTGTTCTCGTCCACCGTCGTCCCCTTGCAGAAGAGTACGAGGCACTA 

KLTVDKSRWQQGNVFSC.SVM- 

S 

U n M M HNC 

S L B N PCR 

I A 0 L AIF 

13 2 1 211 

/ 

GCATGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTGTCTCCGGGTAAATG 

3061 * * — ♦ ♦ ♦ 3120 

CGTACTCCGAGACGTGTTCGTGATGTGCGTCTTCTCGGAGAGGGACAGAGGCCCATTTAC 



HEALHNHYTQKSLSLSPCK 



CXHHN 
FMAPA 
RAEAE 
13321 
/ 

AGTGCGACGGCCG 

3321 ♦ — 3133 

7CACGCTGCCGGC 
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FN Ittl*-* S B 

N S B M H DHA S 

UP B N G RAU T 

4 B V L A AE9 X 

H 2 11 1 236 1 

GCCTGTTTGAGAAGCAGCGGGCAAGAAAGACGCAAGCCCAGAGGCCCTGCCATTTC7GTG 

- 4 4 4 ♦ ♦ 60 

CGGACAAACTCTTCGTCGCCCGTTCTTTCTGCGTTCGGGTCTCCGGGACGGTAAAGACAC 

B PS S s 

DBS ADNPA D DHNA M HM. HNC 

DAP VRLUU D RALU N AN PCR 

EN1 AAAM9 E AEA9 L EL AIF 

122 22416 1 2346 1 31 211 

GGCTCAGGTCCCTACTGGCTCAGGCCCCTGCCTCCCTCGGCAAGGCCACAATGAACCGGG 
61 CCGAGTCCACGGATGACCGAGTCCGGGGACGGAGGGAGCCGTTCCGGTGTTACTTGGCCC 

M N R G - 

H F F 

t B N HH NMD 

t B U HA UN D 

c y 4 AE 4 L E 

1 i H 12 H 1 1 

GAGTCCCTTTTAGGCACTTGCTTCTGGTGCTGCAACTGGCGCTCCTCCCAGCAGCCACTC 

191 ♦ ♦ ♦ — * 180 

CTCAGGGAAAATCCGTGAACCAAGACCACGACGTTGACCCCGAGGAGGGTCGTCGGTGAG 
VPFRHLLLVLQLALLPAATQ- 



B EE 
B C C 
V 0 0 



R A 
S L 
A U 



IKK 1 1 

AGGGAAAGAAAGTGGTCCTGGCCAAAAAAGGGGATACAGTGGAACTGACCTGTACAGCTT 

181 --. ♦ < « ♦ * ♦ 24C 

TCCCTTTCTTTCACCACGACCCGTTTTTTCCCCTATGTCACCTTGACTGGACATGTCGAA 

GKKVVLGKKGDTVELTCTAS- 

H 

M M I 

B B N 

0 0 F 

2 2 1 
CCCAGAAGAAGAGCATACAATTCCACTCGAAAAACTCCAACCAGATAAAGATTCTGGGAA 

4 ^ 4 ♦ ♦ ♦ 300 

1 CGGlCnCTTCTCGTATGTTMGGTGACClTrTTGACGTTGGTCTATTTCTAAGACCCTT 

QKKSI.QFHWKNSNQIKILGN- 
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B S 

NBS F AA 

LAP 0 VU 

AN1 K A9 

422 1 26 

/ / 
ATCAGGGCTCCTTCTTAACTAAAGGTCCATCCAAGCTCAATGATCGCGCTGACTCAAGAA 

301 - ♦ •» ♦ ♦ ♦ ♦ 360 

TAGTCCCGAGGAAGAATTGATTTCCAGGTAGGTrCGACTTACTAGCGCGACTGAGTTCTT 

QGSFLTKGPSKLNDRADSRR- 
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GAAGCCTTTGGGACCAAGGAAACTTCCCCCTCATCATCAACAATCTTAAGATAGAAGACT 

361 4 4 4 4 4 4 420 

CTTCGGAAACCCTGGTTCCTTTGAAGGGGGACTAGTAGTTCTTAGAATTCTATCTTCTGA 
SL WDQGNFPLI I KNLKI EDS- 

S 

M M AMAM M 

B N VNUN A 

0 L AL9L E 

2 1 2161 1 

CAGATACTTACATCTGTCAAGTGGACGACCAGAAGGAGGAGCTGCAATTGCTAGTGTTCG 

421 4 — -4 4 4 4 -4 480 

GTCTATGAATGTAGACACTTCACCTCCTGGTCTTCCTCCTCCACGTTAACGATCACAAGC 
DTYICEVE DQKEEVQLLVFG- 

B 

S S 
P T 
M Y 
1 1 

GATTGACTGCCAACTCTGACACCCACCTGCTTCAGGGCCAGAGCCTGACCCTGACCTTGC 
481 « •» ♦ 4- — * 540 

CTAACTGACGGTTGAGACTGTGGGTGGACGAAGTCCCCGTCTCGGAf.TGGGACTGGAACC 

LTANSDTHLLQGQSLTLTLE- 

B BS H 

BS SC D MIS 

AP 7R D N NT 

Nl NF E L F Y 

22 11 1 111 

/ / 

AGAGCCCCCCTGGTAGTAGCCCCTCAGTGCAATGTAGGAGTCCAAGGGGTAAAAACATAC 

541 4 4 4 4 4 ♦ 600 

TCTCGGGGGGACCATCATCGGGGACTCACGTTACATCCTCAGGTTCCCCATTTTTCTATG 
SPPCSSPSVQCRSPRGKNIQ- 
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N BBH S B BS 

M MD ASP A B5SGSC S B N SC 

B ND LPV L APTIAR T A L TR 

0 LE UBU U N1NACF X N A NF 

2 11 122 1 221111 1 1 A 11 

ACGGGGGCAAGACCCTCTCC3TCTCTCACCTCGAGCTCCACCATACTGGCACCTCGACAT 

601 _ „ 4 ♦ ♦ 660 

7CCCCCCCT7CTGGGAGAGGCACAGAGTCGACCTCGAGGTCCTATCACCGTGGACCTGTA 
GGKTLSVSQLELQOSGTWTC- 

NS M m A 

LP B HA L 

AH 0 EE U 

GCACTGTCTTGCAGAACCAGAAGAAGGTGGAGTTCAAAATAGACATCGTGGTGCTAGCTT ^ 

661 CGTGACAGAACGTCTTGGTCTr 

TV LQNRKKVEFKIDIVVLAF- 

HS MM 
AT N N 

EU L L 

31 1 1 

TCCAGAAGGCCTCCAGCATAGTCTATAAGAAAGAGGGGGAACAGGTGGAGTTCTCCTTCC ^ 

AGGTCTTCCGGAGGTCGTATCAGATATTCTT7CTCCCCCTTGTCCACCTCAAGAGCAAGG 

QKA5SIVYKKEGEQVEFSFP- 

A AM 
L L N 

U U L 

1 1 1 

CACTCGCCTTTACAGTTGAAAAGCTGACGGGCAGTGGCGAGCTGTGGTGGCACGCGGAGA 

76 i — ♦ ♦ ♦ 840 

' GTGAGCGGAAATGTCAACTTTTCCACTCCCCGTCACCGCTCGACACCACCGTCCGCCTCT 

LAFTVEKLTG SGELWWQAER- 
P S 

H M FM A M 

P N LN U B 

H L ML 3 0 

1 1 11 A 2 
GGGCTTCCTCCTCCAAGTCTTGGATCACCTTTGACCTGAAGAACAAGGAAGTGTCTGTAA 

B41 .._ * 4- ♦ - — ♦ ♦ 9°° 

CCCGAAGGAGGAGGTTCAGAACCTAGTGGAAACTGGACTTCTTGTTCCTTCACAGACATT 

ASSSKSWITFDLKNKEVSVK- 
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B BS PS 

SM SCADNPAD A AH 

TA TRVRLUUD L LP 

EE NFAAAM9E U U H 

23 11224161 1 11 

AACGGGTTACCCAGGACCCTAAGCTCCAGATGCGCAAGAAGCTCCCGCTCCACCTCACCC 

901 ♦ ♦ ♦ ♦ ♦ ♦ 960 

TTGCCCAATGGGTCCTGGGATTCGAGGTCTACCCGTTCTTCGAGGGCGAGGTGGAGTGGG 

H L T L - 
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TGCCCCAGGCCTTGCCTCAGTATGCTGGCTCTGGAAACCTCACCCTGGCCCTTGAAGCGA 

961 — ♦ ♦ ♦ -♦— ♦ 1020 

ACGGGGTCCGGAACGGAGTCATACGACCGAGACCTTTGGAGTGGGACCGGGAACTTCGCT 
PQALPQYAGSCNLTLALEAK- 

S BS 

F SC H D A 

A TR P D L 

N NF HE U 

1 11 111 

/ 

AAACACGAAACTTCCATCAGGAAGTGAACCTGC7GGTCATCAGACCCACTCACCTCCAGA 

1021 — •» ♦ ♦ + 1080 

TTTGTCCTTTCAACGTAGTCCnCACTTGGACCACCACTACTCTCGGTGAGTCGAGGTCT 
TGKLHQEVNLVVMRATQLUK- 

PS S 

W ADNNPA DF AM DE A 

N VRLLUU DA LN OS L 

L AAAAM9 EN UL EP U 

1 224416 11 11 11 1 

///// / / / 

AAAATTTGACCTGTGAGGTGTGGGGACCCACCTCCCCTAAGCTGATGCTGAGCTTGAAAC 

1081 ♦ *-- ♦ ♦ ^--* ♦ 1140 

TTTTAAACTGGACACTCCACACCCCTGGGTGGAGGGGATTCGACTACGACTCGAACTTTG 

NLT C EVWGPTSPKLMLSLKL- 

M T H M DM 

N A P N DS 

L q A L ET 

1 1 2 1 12 

/ 

TGGAGAACAAGGAGGCAAAGGTCTCGAAGCGGGAGAAGCCGGTGTGGGTGCTGAACCCTG 

1141 * •» ♦ ♦ * ♦ 1200 

ACCTCTTGTTCCTCCGTTTCCAGAGCTTCGCCCTCTTCGGCCACACCCACGACTTGGGAC 

ENKEAKVSKREKPVWVLNPE- 
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H PS H 

F D MIA ADPA I 

0 D A N V VRUU N 
K E E F A AAM9 F 

1 13 11 2216 1 

AGGCGGGGATGTGGCAGTGTCTGCTGAGTGACTCGGGACAGGTCCTGCTGGAATCCAACA 

1201 — ♦ * 4 4- * 1260 

TCCGCCCCTACACCGTCACAGACGACTCACTGAGCCCTGTCCAGGACGACCTTAGGTTGT 

AGMWQCLLSDSGQVLL ESNI- 

S SA BHF BS H 

ANA HNCP SGNMAANXA RSD I A 

VLU PCRA PIUNMULHV SCD N L 

AA9 AIFL 1ADLH3A0A AAE D U 

236 2111 21211A421 111 3 1 

// ////// / 

TCAAGGTTCTGCCCACATGGTCCACCCCGGTGCACCCGGATCCCGAGGGTGAGTACTAAG 

1261 4 4 4 -4 4 ♦ 1320 

AGT7CCAAGACGGGTGTACCAGGTGGGGCCACGTGCGCCTAGGGCTCCCACTCATGATTC 

D P E 

BS F 
BSC N 
BTR U 
VNF 4 
111 H 

CTTGAGCGCTCCTGCCTGGACGCATCCCGGCTATGCAGCCCCAGTCCAGGGCAGCAAGGC 
1321 » * «. «. 4 J35Q 

GAAGTCGCGAGGACGGACCTGCGTAGGGCCGATACGTCGGGGTCAGGTCCCGTCGTTCCG 

S S 
DBHVHNA Hi/NCN M MNDM 

RBA5PLU PNCRL N NLDB 

AVE0HA9 AlIFA L LAEO 

2132146 21114 1 1312 

// // // 
AGGCCCCGTCTGCCTCTTCACCCGGAGCCTCTGCCCGCCCCACTCATGCTCAGGGAGAGG 

I 381 4 ♦ 4 4 1440 

TCCGGGGCAGACGGAGAAGTGGGCCTCGGAGACGGGCGGGGTGAGTACGAGTCCCTCTCC 

BS P B BS S 

SC F M B N S SCOHA 

TR L A A L P TRRAU 

NF M E N A 1 NFAE9 

11 1 114 2 11236 

/ / / 

GTCTTCTGGCTTTTTCCCAGGCTCTGGGCAGGCACAGGCTAGGTGCCCCTAACCCAGGCC 

1 44 1 ♦ '— » * * ♦ 4 1500 

CAGAACACCGAAAAACGGTCCCAGACCCGTCCGTGTCCGATCCACGGCGATTCGGTCCGG 
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CTGCACACAAAGGGGCAGGTCCTGGGCTCAGACCTCCCAACAGCCATATCCGGGAGGACC 

1501 ♦ ♦ ♦ ♦ ♦ ♦ 1560 

GACGTGTGTTTCCCCGTCCACGACCCGAGTCTGGACGGTTCTCGGTATAGGCCCTCCTGG 

D H DAM 

D A D L N 

E E E U L 

1 3 111 

CTGCCCCTGACCTAAGCCCACCCCAAACGCCAAACTCTCCACTCCCTCAGCTCGGACACC 

1561 * ♦ — ♦ ♦ 1620 

GACGGGGACTGGATTCGGGTGGGGTTTCCGGTTTGAGAGGTGAGGGAGTCGAGCCTGTGG 
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nCTCTCCTCCCACATTCCAGTAACTCCCAATCnCTCTCTGCAGAGCCCAAATCTTGTG 

1621 — - - * * * 168C 

AAGAGAGCAGGCTCTAACGTCATTGAGGGTTAGAAGAGACACGTCTCGGGTTTAGAACAC 

E P K S C D - 

N BBS BS 

NS SSC SC HS MA 

LP PTR TR AT N L 

AH IN- NF EU LU 

31 211 11 31 11 

/ / / / 

acaaaa:tcacacatgccca::gtgcccaggtaagccagcccaggcctcgccctccagct 

1661 ♦ ♦ ♦ ♦ 1740 

tgttttgagtgtgtacgggtggcacgggtccattcggtcgggtccggagcgggaggtcga 
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caaggcgcgacacgtgccctagactaccctgcatccacccacaggccccagccgcctcct 

1741 « - ♦ ♦ * ♦ 1800 

gttccgccctgtccacgggatctcatccgacgtaggtccctgtccggggtcggcccacga 
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BS S 

AM M M D M SC M ANA M 

FAB N D N TR B VLU B 

LEO L E L NF 0 AA9 0 

3 2 2 1 1 1 11 2 246 2 

GACACGTCCACCTCCATCTCTTCCTCAGCACCTGAACTCCTGGGGGGACCGTCAGTCnC 

j 80i 4 ♦ ♦ ♦ 4 1860 

CTGTCCAGGTGGACGTAGAGAAGGAGTCGTGGACTTGAGGACCCCCCTGGCAGTCAGAAG 

APELLGGPSVF - 

S SS N 

MS AN M HMANNAC DM M N5 

NT UL N PNVCLUR DS A LP 

L Y 3A L ALAIA9F ET E AH 

11 A3 1 2121461 12 3 31 

/ / // / / 
CTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAGGTCACATGC 
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GTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGACGGC 
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CACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATGCACCTGCCG 
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M T 
N A 
L Q 
1 1 
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CTCTCACTGGCGACATGGTTGGACACAGGATGTCCCGTCGGGGCTCTTGGTGTCCACATG 

GQPREPQVY - 



EP 0 325 262 B1 



SS BS BS B 

AHNN'CCS A F SC SC S 

. VPCCRRV L 0 TR TR P 

AAUFFA U K NF NF M 

1211111 1 1 11 11 1 

ACCCTGCCCCCATCCCGGGATGAGCTGACCAAGAACCAGGTCAGCCTGACCTCCCTGGTC 
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TGGGACGGGGGTAGGGCCCTACTCGACTGGTTCTTGGTCCAGTCGGACTGGACGGACCAG 
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AAAGGCTTCTATCCCAGCGACATCGCCGTGGAGTGGGAGAGCAATGGGCAGCCGGAGAAC 
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TTTCCGAAGATACGGTCGCTGTAGCGGCACCTCACCCTCTCGTTACCCGTCGGCCTCTTG 
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CTCACCGTGGACAAGAGCAGGTGGCAGCAGGGGAACGTCTTCTCATGCTCCGTGATGCAT 

2461 ———4 4 4. T 4 4 4 2520 

GAGTGGCACCTGTTCTCGTCCACCGTCGTCCCCTTGCAGAAGAGTACGAGGCACTACGTA 
LTVDKSRWQQGNVFSCSVMH - 
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CAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTCTCTCCGGCTAAATGAGTG 

2521 ♦ * * * *" *"*" * 
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EP 0 325 262 B1 



Table 3 



70 



75 



20 



25 



30 



35 



40 



F N 








S 


B 


N S 


B 


If. 


H 


DHA 


S 


U P 


B 


H 


G 


RAU 


T 


4 B 


V 


L 


A 


AE9 


X 


H 2 


1 


1 


1 


236 


1 



45 



I 

GCCTGTTTGAGAACCAGCGGGCAAGAAAGACCCAAGCCCAGACGCCCTGCCATT7CTGTG 

1 — 4 4 4 4 4 4 60 

CGGACAAACTCTTCGTCGCCCGTTCTTTCTGCGTTCGGGTCTCCGGGACGGTAAAGACAC 

B PS S S 

DBS ADNPA D DHNA M HM HNC 

DAP VRLUU D RALU N AN PCR 

EN1 AAAV.9 E AEA9 L EL AIF 

122 22416 1 2346 1 31 211 

/ / // / / / 

GGCTCACGTCCCTACTGGCTCAGGCCCCTGCCTCCCTCGCCAAGGCCACAATGAACCGGG 

61 4 ♦ 4 4 4 ♦ 120 

CCGAGTCCAGGGATGACCGAGTCCGGGGACGGAGGGAGCCGTTCCGGTGTTACTTGGCCC 

M N R G - 

H F F 

I B N HH NMD 

N B U HA U N D 

F V 4 AE 4 L E 

1 1 H 12 H 1 1 

GAGTCCCTTTTAGGCACTTGCTTCTGGTGCTGCAACTGGCGCTCCTCCCAGCAGCCACTC 

121 " . 4 4- 4 4 4- 4 180 

CTCAGGGAAAATCCGTGAACGAAGACCACGACGTTGACCGCGAGGAGGGTCGTCGGTGAG 

VPFRHLLLVLQLALLPAATQ- 
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B C C S L 

V 0 0 A U 
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AGGCAAAGAAAGTCGTGCTGGGCAAAAAAGCGCATACACTGGAACTGACCTGTACAGCTT 
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TCCCTTTCTTTCACCACGACCCGI 1 1 1 1 1 CCCCTATGTCACCTTGACTGGACATGTCGAA 
GKKVVLGKKGDTVELTCTAS- 
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CCCAGAAGAAGAGCATACAATTCCACTGGAAA.AACTCCAACCAGATAAAGATTCTGGGAA 
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GGGTCTTCTTCTCGTATGTTAAGGTGACCTTTTTGAGGTTGGTCTATTTCTAAGACCCTT 

QKKSIQFHWKNSNQIKILCN- 

B S S F H 

NBS F AA A A N H I 

LAP 0 VU L U UH N 

AN1 K A9 U 3 D A F 

422 1 26 1 A 2 1 1 

ATCAGGGCTCCTTCTTAACTAAAGGTCCATCCAAGCTGAATGATCGCGCTGACTCAAGAA 

20 3oi 4 —4 * 4 * 4 360 

TAGTCCCGAGGAAGAATTGATTTCCAGGTAGGTTCGACTTACTAGCGCGACTGAGTTCTT 

QGSFLTKCPSKLNDRADSRR- 

25 s S H H 

MANAS BA I A ID 

BVLUT CU N F N D 

0AA9Y L3 F L F E 

22461 1A 12 11 

GAAGCCTTTGGGACCAAGGAAACTTCCCCCTGATCATCAAGAATCTTAAGATAGAAGACT 
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CTTCGGAAACCCTGGT7CCTTTGAAGGGGGACTAGTAGTTCTTAGAATTCTATCTTCTGA 

WDQGNFPLI IKNLKIED. S- 
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CAGATACTTACATCTGTGAAGTGGAGGACCAGAAGGAGGAGGTGCAATTGCTAGTGTTCG 
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45 GTCTATGAATGTAGACACTTCACCTCCTGGTCTTCCTCCTCCACGTTAACGATCACAAGC 

DTYICEVEDQKEEVQLLVFC- 
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1 1 

GATTGACTGCCAACTCTGACACCCACCTGCTTCAGGGGCAGAGCCTGACCCTGACCTTGG 

CTAACTGACGGTTGAGACTGTGGGTGGACGAAGTCCCCGTCTCGGACTGGGACTGGAACC 

LTANSDTHLL QGQSLTlTLE- 

B BS H 

BS SC D V I S 

AP TR D N NT 

Nl NF E L FY 

22 11 1 111 

AGAGCCCCCCTGGTAGTAGCCCCTCAGTGCAATGTAGGAGTCCAAGGGGTAAAAACATAC 

541 4 * * — r ♦ ♦ 600 

TCTCGGGGGGACCATCATCGGGGAGTCACGTTACATCCTCAGGTTCCCCATTTTTGTATG 

SPPGS SPSVQCRSPRGKNIQ- 

N BBH S B BS 

U MD ASP A BSSGSC S B N SC 

B ND LPV L APTIAR T A L TR 

0 LE UBU U N1NACF X N A NF 

2 11 122 1 221111 1 1 4 11 

AGGGGGGGAAGACCCTCTCCGTGTCTCAGCTGGAGCTCCAGGATAGTGGCACCTGGACAT 

601 * 4 4 "4 4 4 660 

TCCCCCCCnCTGGGAGAGGCACAGAGTCGACCTCGAGGTCCTATCACCGTGGACCTGTA 
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GCACTGTCTTGCAGAACCAGAAGAAGGTGGAGTTCAAAATAGACATCGTGGTGCTAGCTT 

661 4 ♦ — ♦ ♦ ♦ — -------- 720 

CGTGACAGAACGTCTTGGTCTTCTTCCACCTCAAGTTTTATCTGTAGCACCACGATCGAA 

TVLQNRK-KVEFKIDIVVLAF- 
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31 1 1 

TCCAGAAGGCCTCCAGCATAGTCTATAAGAAAGAGGGGGAACAGGTGGAGTTCTCCTTCC 

721 ♦ 4 * * * 

AGGTCTTCCGGAGGTCGTATCAGATATTCTTTCTCCCCCTTGTCCACCTCAAGAGGAAGG 

QKASSIVYKKEGEQVEFSFP- 

A AM 
L L N 

U U L 

1 1 1 

CACTCGCCTTTACAGTTGAAAAGCTGACGGGCAGTGGCGAGCTGTGGTGGCAGGCGGAGA 

7 gi ♦ * — * *-- ♦ — ♦ 840 

GTGAGCGGAAATGTCAACTTTTCGACTCCCCG T CACCGCTCGACACCACCGTCCGCCTCT 

LAFTVEKLTCSCEL**' QAER- 
P S 

H . M FM A M 
P N LN U B 
H L ML 3 0 
1 1 11 A 2 
GGGCTTCCTCCTCCAAGTCTTGGATCACCTTTGACCTGAAGAACAAGGAAGTGTCTGTAA 

B41 ♦ * ♦ — — ♦ * 900 

CCCGAAGGACGAGGTTCAGAACCTAGTGGAAACTGGACTTCTTGTTCCTTCACAGACATT 

ASSSKSWIT FDLKNKEVSVK- 

B BS PS 

SM SCADNPAD A AH 
TA TRVRLUUD L LP 
EE NFAAAW9E U U H 

23 11224161 1 11 

AACGGGTTACCCAGGACCCTAAGCTCCAGATGGGCAAGAAGCTCCCGCTCCACCTCACCC 
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TTGCCCAATGGTJTCCTGGGATTCGAGGTCTACCCGTTCTTCGAGGGCGAGGTGGAGTGGG 

RVTQDPK LQMGKKLPLHLTL- 

BS BSS 
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N TR AT D N P TRUAN 

L NF EU E L H NF9EL 

1 11 31 1 11 ll 631 

/ / If 
TGCCCCAGGCCTTGCCTCAGTATGCTGGCTCTGGAAACCTCACCCTGGCCCTTGAAGCGA 
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ACGGGGTCCGGAACGGAGTCATACGACCGACACCTTTGGAGTGGGACCGGGAACTTCGCT 

PQALPQYAGSGNLTLALEAK- 
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AAACAGGAAACTTGCATCACGAAGTGAACCTGGTGGTGATGAGAGCCACTCACCTCCAGA 
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TTTGTCCTTTCAACGTAGTCCTTCACTTGGACCACCACTACTCTCGGTGAGTCGAGGTCT 

TCKLHQEVNLVVMRATQLQK- 

PS S 

M ADNNPA DF AK' DE A 

U VRlLUU da ln ds l 

L AAAAVS EN UL EP U 

1 224416 22 12 11 1 

///// / / / 

. AAAATTTGACCTGTGAGGTGTGGGGACCCACCTCCCCTAAGCTGATGCTGAGCTTGAAAC 

1081 ♦-- -■» ♦ « * ♦ 1140 

TTTTAAACTGGACACTCCACACCCCTGGGTGGAGGGGATTCGACTACGACTCGAACTTTG 

NLTCEVWGPTSPKLMLSLKL- 
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L Q A L ET 

1 1 2 1 12 

/ 

TGGAGAACAAGGAGGCAAAGGTCTCGAAGCGGGAGAAGCCGGTGTGGGTGCTGAACCCTG 
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ACCTCTTGTTCCTCCGTTTCCAGAGCTTCGCCCTCTTCGGCCACACCCACGACTTGGGAC 

ENKEAKVSKREKP VWVLNPE- 
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F D M I A ADPA I 

0 D A N V VRUU N 

K E E F A AAV 9 F 

113 11 2216 1 

/// 

AGGCGGGGATGTGGCAGTGTCTGCTGAGTGACTCGGGACAGGTCCTGCTGGAATCCAACA 

1201 ♦ * ♦ ♦ ♦ ♦ 1260 

TCCGCCCCTACACCGTCACAGACGACTCACTGAGCCCTGTCCAGGACGACCTTAGGTTGT 

AGMWQC LLSDSGQVLLE SNI- 
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CTTCAGCGCTCCTGCCTGGACGCATCCCGGCTA7GCAGCCCCAGTCCAGGGCAGCAAGGC 

1321 ♦ * ♦ 1380 

CAACTCGCGAGGACGGACCTGCG7AGGGCCCA7ACGTCGGGCTCACGTCCCGKGTTCCC 

5 S 

DBHVHNA HMN-CK V VNDV 

RBABPLU PNCR_ K N'.DS 

AVE0HA9 ALIFA L LA£0 
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AGGCCCCGTCTGCCTC7TCACCCGGAGCCTCTGCCCGCCCCAC7CATGCTCAGGGAGAGG 
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7CCGGGGCAGACGGAGAAG7GGGCC7CGGACACGGGCGGGG7GAG7ACGAG7CCC7C7CC 
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G7C77C7GGC7TTnCCCAGGC7C7GGGCAGGCACAGGC7AGG7GCCCC7AACCCAGGCC 

1441 ♦ * ♦ ♦ 4 —4 1500 

CAGAAGACCGAAAAAGGG7CCGAGACCCG7CCG7G7CCGA7CCACGGGGA77GGG7CCGG 
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GACC7G7CTTrCCCCG7CCACCACCCCAC7C7CCACCC77C7CCG7A7AGCCCC7CC7GC 
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GACGGGGACTGGATTCGGGTGGGGTTTCCGGTTTGAGAGGTGAGGGAGTCGAGCCTGTGG 
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AAGAGAGGAGGGTCTAAGGTCATTGAGGCTTAGAAGACAGAGTCCC7CACGTAGGCGGGG 
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CGGACAAACTCTTCGTCGCCCGTTCTT7CTGCGTTCGGGTCTCCGGGACGGTAAAGACAC 
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DAP VRLUU D RALU N AN PCR 
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GAAGCCTTTGGGACCAAGGAAACTTCCCCCTGATCATCAAGAATCTTAAGATAGAAGACT 
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CTTCGGAAACCCTGGTTCCTTTGAAGGGGGACTAGTAGTTCTTAGAATTCTATCTTCTGA 
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CAGATACTTACATCTGTGAAGTGGAGGACCAGAAGGAGGAGGTGCAATTGCTAGTGTTCG 
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GTCTATGAATGTAGACACTTCACCTCCTGGTCTTCCTCCTCCACGTTAACGATCACAAGC 

DTYICEVEDQKEEVQLL VFG- 
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GATTGACTGCCAACTCTGACACCCACCTGCTTCAGGGGCAGAGCCTGACCCTGACCTTGG 

491 4 4. 4 ^ ^ 

CTAACTGACCCTTGAGACTGTGGGTGGACGAACTCCCCGTCTCGGACTGGGACTGGAACC 
LTANSDTHLLQGQ. SLTLTLE- 

B BS H 
BS SC D V IS 

AP TR D N NT 

N] NF E L FY 

22 11 1 111 

/ / 

AGAGCCCCCCTGGTAGTAGCCCCTCAGTGCAATGTAGGAGTCCAACGGGTAAAAACATAC 
$41 ♦ „ «. 4 4 + 6Q0 

TCTCGGGGGGACCATCATCGGGGAGTCACGTTACATCCTCAGGTTCCCCATTTTTGTATG 
SPPGSSPSVQCRSPRGKNIQ- 
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B ND LPV L APTIAR T A L TR 

0 LE UBU U N1NACF X N A NF 
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AGGGGGGGAAGACCCTCTCCGTGTCTCAGCTGGAGCTCCAGCATAGTCGCACCTGGACAT 
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TCCCCCCCTTCTGGGAGAGGCACAGAGTCGACCTCGAGGTCCTATCACCGTGGACCTGTA 
30 CGKTLSVSQLELQDSCTWTC- 
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CCACTGTCTTGCAGAACCAGAAGAAGGTGGAGTTCAAAATACACATCGTGGTGCTAGCTT 
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CGTGACAGAACGTGTTGGTCTTCTTCCACCTCAAGTTTTATCTCTAGCACCACGATCGAA 
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TCCAGAAGGCCTCCAGCATAGTCTATAAGAAAGAGGGGGAACAGGTCGACTTCTCCTTCC 
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AGGTCTTCCGGAGGTCGTATCAGATATTCITTCTCCCCCTTGTCCACCTCAAGAGGAACG 
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CCGTGCTGGACTCCGACGGCTCCTTCTTCCTCTACACCAAGCTCACCGTGGACAAGAGCA 
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GGCACGACCTGAGGCTGCCGAGGAACAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGT 
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GGTGGCAGCAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACT 

2041 — — ♦ — 2100 

CCACCGTCGTCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACCTGTTGGTGA 

WQQGNVFSC SVMHEALHNHY- 

S 

M M HNC CXH 
B N PCR FMA 
0 L AIF RAE 
2 1 211 133 

/ / 

ACACGCAGAAGAGCCTCTCCCTG7CTCCGGGTAAATGAGTGCGACGGCCG 

2ioi ♦ ♦ ♦ — ♦ ♦ 2150 

TGTGCGTCTTCTCGGAGAGGGACAGAGGCCCATTTACTCACGCTGCCCGC 

TQKSLS LSPGK. 



Example 2: Preparation of the Fusion Proteins from Supernatants of COS Cells 

COS cells grown in DME medium supplemented with 10% Calf Serum and gentamicin sulfate at 15 
ug/ml were split into DME medium containing 10% NuSerum (Collaborative Research) and gentamicin to 
give 50% confluence the day before transfection. The next day, CsCI purified plasmid DNA was added to a 
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final concentration of 0.1 to 2.0 u,g/ml followed by DEAE Dextran to 400 ng/ml and chloroquine to 100 uM. 
After 4 hours at 37 °C, the medium was aspirated and a 10% solution of dimethyl sulfoxide in phosphate 
buffered saline was added for 2 minutes, aspirated, and replaced with DME/10% Calf Serum. 8 to 24 hours 
later, the cells were trypsinized and split 1 :2. 

5 For radiolabeling, the medium was aspirated 40 to 48 hours after transfection, the cells washed once 

with phosphate buffered saline, and DME medium lacking cysteine or methionine was added. 30 minutes 
later, 35 S-Iabeled cysteine and methionine were added to final concentrations of 30-60 uc\ and 100-200 uci 
respectively, . and the cells allowed to incorporate label for 8 to 24 more hours. The supernatants were 
recovered and examined by electrophoresis on 7.5% polyacrylamide gels following denaturation and 

w reduction, or on 5% polyacrylamide following denaturation without reduction. The CD4B 7 1 protein gave the 
same molecular mass with or without reduction, while the CD4E 7 1 and CD4H 7 1 fusion proteins showed 
molecular masses without reduction of twice the mass observed with reduction, indicating that they formed 
dimer structures. The CD4 IgM fusion proteins formed large multimers beyond the resolution of the gel 
system without reduction, and monomers of the expected molecular mass with reduction. 

75 Unlabeled proteins were prepared by allowing the ceils to grow for 5 to 10 days post transfection in 
DME medium containing 5% NuSerum and gentamicin as above. The supernatants were harvested, 
centrifuged, and purified by batch adsorption to either protein A trisacryl, protein A agarose, goat anti- 
human IgG antibody agarose, rabbit anti-human IgM antibody agarose, or monoclonal anti-CD4 antibody 
agarose^ Antibody agarose conjugates were prepared by coupling purified antibodies to cyanogen bromide 

20 activated agarose accordinig to the manufacturer's recommendations, and using an antibody concentration 
of 1 mg/ml. Following batch adsorption by shaking overnight on a rotary table, the beads were harvested by 
pouring into a sintered glass funnel and washed a few times on the funnel with phosphate buffered saline 
containing 1% Nonidet P40 detergent. The beads were removed from the funnel and poured into a small 
disposable plastic column (Quik-Sep QS-Q column, Isolab), washed with at least 20 column volumes of 

25 phosphate buffered saline containing 1% Nonidet P40, with 5 volumes of 0.15 M NaCI, 1 mM EDTA (pH 
8.0), and eluted by the addition of either 0.1 M acetic acid, 0.1 M acetic acid containing 0.1 M NaCI, or 0.25 
M glycine-HCI buffer, pH 2.5. 

Example 3: Blockage of Syncytium Formation by the Fusion Proteins 

30 

Purified or partially purified fusion proteins were added to HPB-ALL cells infected 12 hours previously 
with a vaccinia virus recombinant encoding HIV envelope protein. After incubation for 6-8 more hours, the 
cells were washed with phosphate buffered saline, fixed with formaldehyde, and photographed. All of the 
full-length CD4 immunoglobulin fusion proteins showed inhibition of syncytium formation at a concentration 
35 of 20 ug/ml with the exception of the 4H 7 1 protein, which was tested only at 5 ug/m! and showed partial 
inhibition of syncytium formation under the same conditions. 

Example 4: Chromium Release Cytolysis Assay 

40 The purified fusion proteins were examined for ability to fix complement in a chromium release assay 
using vaccinia virus infected cells as a model system. Namalwa (B cell) or HPB-ALL (T cell) lines were 
infected with vaccinia virus encoding HIV envelope protein, and 18 hours later were radiolabeled by 
incubation in 1 mci/ml sodium 51 chromate in phosphate buffered saline for 1 hour at 37°. The labeled cells 
were centrifuged to remove the unincorporated chromate, and incubated in microtiter wells with serial 

45 dilutions of the CD4 immunoglobulin fusion proteins and rabbit complement at a final concentration of 40%. 
After 1 hour at 37°, the cells were mixed well, centrifuged, and the supernatants counted in a gamma-ray 
counter. No specific release could be convincingly documented. 

Example 5: Binding of the CD4E 7 1 Protein to Fc Receptors 

50 

Purified CD4E 7 1 fusion protein was tested for its ability to displace radiolabeled human lgG1 from 
human Fc receptors expressed on COS cells in culture. The lgG1 was radiolabeled with sodium 125 iodide 
using 1 mci of iodide, 100 ug of lgG1, and two idobeads (Pierce). The labeled protein was separated from 
•unincorporated counts by passage over a Sephadex G25 column equilibrated with phosphate buffered 
55 saline containing 0.5 mM EDTA and 5% nonfat milk. Serial dilutions of the C04E 7 1 fusion protein or 
unlabeled lgG1 were prepared and mixed with a constant amount of radiolabeled IgGl tracer. After 
incubation with COS cells bearing the FcRI and RcRII receptors at 4«C for at least 45 minutes in a volume 
of 20 nl, 200 ul of a 3:2 mixture of dibutyl to dioctyl phthalates were added, and the cells separated from 
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the unbound label by centrifugation in a microcentrifuge for 15 to 30 seconds. The tubes were cut with 
scissors, and the cell pellets counted in a gamma-ray counter. The affinity of the CD4E7I protein for 
receptors was measured in parallel with the affinity of the authentic IgGl protein, and was found to be the 
same, within experimental error. 

5 

Example 6: Stable Expression of the Fusion Construct PCD4E7I in Baby Hamster Kidney Cells 

Twenty-four hours before transfection, 0.5 x 10 5 baby hamster kidney cells (BHK; ATCC CCL10) were 
seeded in a 25 cm 2 culture flask in Dulbecco's modified Eagle's medium (DMEM) containing 10% of fetal 

10 calf serum (FCS). The cells were cotransfected with a mixture of the plasmids pCD4E 7 1 (20 ug), pSV2dhfr 
(5 ug; Lee et al. , Nature 294:228-232 (1981)) and pRMH140 (5 ng, Hudziak et al. t Cell 31;137-146 (1982)) 
according to a modified calcium phosphate transfection technique as described in Zettlmeissl et al. ( Behring 
Inst. Res. Comm. 82:26-34 (1988)). 72 h post-transfection, cells were split 1:3 to 1:4 (60 mm culture dishes) 
and resistant colonies were selected in DMEM medium containing 10% FCS, 400 ug/ml G418 (Geneticin, 

75 Gibco) and 1 uM methotrexate (selection medium). The medium was changed twice a week. The resistant 
colonies (40-100/transfection) appeared 10-15 day post-transfection and were further propagated either as a 
mixture of clones (i.e., BHK-MK1) or as individually isolated clones. For the determination of the relative 
expression levels, clone mixtures or individual clones were grown to confluency in T25 culture flasks, 
washed twice with protein-free DMEM medium, and incubated for 24 h with 5 ml protein-free DMEM 

20 medium. These media were collected and subjected to a human IgG specific ELISA in order to determine 
the relative expression levels of the CD4~lgG1 fusion protein CD4E7L For further analysis an individual 
clone (BHK-UC3) was chosen due to its high relative expression levels. 

Example 7: Detection of the CD4E 7 1 Protein in Culture Supematants 

25 

For 35 S methionine labeling of cells, the clone BHK-UC3 and untransfected BHK cells (control) were 
grown to confluency in T25 culture flasks and subsequently incubated for two hours in HamF12 medium 
without methionine. Labeling was achieved by incubating 24 h in 2.5 ml of the same medium containing 100 
uCi 35 S methionine (1070 Ci/mmole, Amersham). For the preparation of cell lysates, the labeled cells were 

30 harvested in 1 m! of phosphate buffered saline, pH 7.2 (PBS) and lysed by repetitive freezing and thawing. 
Cleared lysates (after centrifugation 20000 rpm, 20 min) and culture supematants were incubated with 
Protein A-Sepharose (Pharmacia) and the bound material was analyzed on a 10% SDS-Protein A-Sepharose 
(Pharmacia) and the bound material was analyzed on a 10% SDS-gel according to Laemmli (Nature 
227:680-685 (1970)), which was subsequently autoradiographed. A specific band of about 80 KDa can be 

35 detected only in the supernatant of clone BHK-UC3, which is absent in the lysate of clone BHK-UC3 and in 
the respective controls. 

Example 8: Purification of the Protein CD4E 7 1 from Culture Supematants 

40 In order to demonstrate that the fusion protein coded by the plasmid pCD4E 7 1 can be obtained in high 
quantities, the clone BHK-UC3 was grown in 1750 cm 2 roller bottles in selection medium (500 ml). Confluent 
monolayers were washed twice with protein-free DMEM medium (200 ml) and further incubated for 48 h 
with protein-free DMEM medium (500 ml). The conditioned culture supematants (1-2 I) and respective 
supematants from untransfected BHK cells were cleared by centrifugation (9000 rpm, 30 min) and 

45 microfiltered through a 0.45 urn membrane (Nalgene). After addition of 1% (v/v) of 1.9 M Tris-HCI buffer, 
pH 8.6, the conditioned medium was absorbed to a Protein A-Sepharose column equilibrated with 50 mM 
Tris-HCI pH 8.6 buffer containing 150 mM NaCI (4°C). The loaded column was washed with 10 column 
volumes of equilibration buffer. Elution of the CD4-lgG1 fusion protein CD4E 7 1 was achieved with 0.1 M 
sodium citrate buffer, pH 3, followed by immediate neutralization of the column efflux to pH 8 by Tris-base. 

50 The peak fractions were pooled, and the pool was analyzed on a Coomassie blue stained SDS-gel resulting 
in a band of the expected size (80 KDa), and which reacted with a polyclonal anti-human IgG heavy chain 
antibody and a mouse monoclonal anti-CD4 antibody (BMA040, Behringwerke) in Western Blots. The yields 
of purified fusion proteins obtained by the given procedure is 5-18 mg/24 h/l culture supernatant. The 
respective value for a BHK clone mixture (about 80 resistant clones; BHK-MK1) as described above was 2-3 

55 mg/24 h/l. 



C"7 



EP 0 325 262 B1 



Example 9: Physical and Biological Characterization of the CD4E7I Fusion Protein 

As proven by SDS-electrophoresis on 10-15% gradient gels (Phast-System, Pharmacia) under non- 
reductive conditions, the CD4E 7 1 fusion protein migrates at the position of a homodimer (about 160 KDa) 

5 like a non-reduced mouse monoclonal antibody. This result is supported by analytical equilibrium ultracen- 
trifugation, where the fusion protein behaves as a homogeneous dimeric molecule of about 150 KDa. The 
absorbance coefficient of the protein was determined as A280 = 18 cm 2 /mg using the quantitative protein 
determination according to Bradford ( Anal. Biochem. 72:248-254 (1976)). 

The CD4E 7 1-fusion protein shows specific complex formation with a solubilized £gal-gp120 fusion 

10 protein (pMB1790; Broker et a). , Behring Inst. Res. Commun. 82:338-348 (1988)) expressed in E. coli . In this 
protein (110 KDa), a major part of the HIV gp120 protein (VaU 9 -Trp 5 4s) is fused to 0-galactosidase (amino 
acids 1-375). In a control experiment a 67-KDa 0gal-HIV 3'orf fusion protein (j8gal1-375; 3'orf Pro14-Asp123) 
showed no complex formation. In these experiments, the CD4E 7 1 -protein was incubated with the respective 
fusion protein in molar rations of about 5:1. The complex was isolated by binding to Protein A-Sepharose 

75 and the Protein A-Sepharose bound proteins-together with relevant controls-were analyzed on 10-15% 
gradient SDS-gels (Phast-System, Pharmacia). 

The CD4E 7 1 fusion protein binds to the surface of HIV (HIV1/HTLV-IIIB) infected cultured T4- 
lymphocytes as determined by direct immunofluorescence with fluorescein-isothiocyanate (FITC) labeled 
CD4E 7 1 protein. It blocks syncytia formation in cultured T4-!ymphocytes upon HIV infection (0.25 TCID/cell) 

20 at a concentration of 10 ug/ml. Furthermore, HIV-infected cultured T4-lymphocytes (subclone of cell line 
H9) are selectively killed upon incubation with CD4E 7 1 in the presence or absence of complement: To a 
highly (>50%) HIV infected culture of T4-lymphocytes (10 5 cells/ml) 50, 10 or 1 ug/ml CD4E 7 1 fusion 
protein was added in the presence or absence of guinea pig complement. Cells were observed for specific 
killing by the fusion protein, which is defined by the percentage of killed cells after 3 days in relation to 

25 viable cells in the culture at the beginning of the experiment corrected by the values for unspecific killing 
observed in control cultures, lacking the CD4E 7 1 fusion protein (Table 5, Experiment I). Surprisingly, 
addition of CD4E 7 1 protein to the infected T4 cells in the absence of complement resulted in similar 
specific killing rates as in the presence of complement (Table 5, Experiment II). This result demonstrates a 
complement independent cytolytic effect of CD4Ey1 on HIV infected T-lymphocytes in culture. 

30 

Table 5 



No. 
Experiment 


Assay System 


Specific Killing (%) 


I 


non-infected T4-cells + 50 ug/ml CD4E 7 1 + Compl. 
infected T4-cells + 50 ug/ml CD4E 7 1 + Compl. 
infected T4-cells + 10 ug/ml CD4E 7 1 + Compl. 
infected T4-cells + 1 ug/ml CD4E 7 1 + Compl. 


0.7 
35.1 
25.1 
25 


II 


infected T4-cells + 10 ug/ml CD4E 7 1 + Compl. 
infected T4-celis + 10 ug/ml CD4E 7 1 + Compl. 


49.9 
69.4 



Having now fully described this invention, it will be appreciated by those skilled in the art that the same 
45 can be performed with any wide range of equivalent parameters of composition, conditions, and methods of 
preparing such fusion proteins without departing from the spirit or scope of the invention or any 
embodiment thereof. 

Claims 

50 Claims for the following Contracting States : AT, BE, CH, DE, FR, GB, IT, LI, LU, NL, SE 

1. A gene encoding a fusion protein which is capable of being secreted comprising 1) the DNA sequence 
of extracellular CD4, or fragment thereof which binds to HIV gp 120 when fused to an immunoglobulin 
chain, and 2) the DNA sequence of an immunoglobulin heavy chain, characterized in that the DNA 
55 sequence which encodes at least the variable region of said immunoglobulin has been replaced with 
the DNA sequence which encodes extracellular CD4, or said gp120 binding fragment thereof. 
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2. The fusion protein gene claim 1, characterized in the said immunoglobulin chain is of the class IgM, 
lgG1 or lgG3. 

3. A vector comprising the fusion protein gene of claim 1, or having the identifying characteristics of 
5 PCD4H7I, which has been deposited under Accession No. 67611, or pCD4Mu, which has been 

deposited under Accession No. 67608, or of pCH4Pn, which has been deposited under Accession No. 
67609, or of PCD4E7I, which has been deposited under Accession No. 67610, all in E. coli at the 
ATCC under the terms of the Budapest Treaty. 

70 4. A host transformed with the vector of claim 3. 

5. The host of claim 4 which expresses an immunoglobulin light chain together with the expression 
product of said fusion protein gene to give an immunoglobulin-like molecule which binds to gp120. 

75 6. The host of claim. 5, wherein said immunoglobulin heavy chain is of the immunoglobulin class IgM, 
lgG1 or lgG3. 

7. A method of producing a fusion protein encoded by a gene of claim 1, characterized by cultivating in a 
nutrient medium under protein-producing conditions, a host strain transformed with the vector compris- 

20 ing the gene of claim 1, said vector further comprising expression signals which are recognized by said 
host strain and direct expression of said fusion protein, and recovering the fusion protein so produced. 

8. The method of claim 7, wherein said host strain is a myeloma cell line which produces immunoglobulin 
light chains and said fusion protein comprises an immunoglobulin heavy chain of the class IgM, lgG1 or 

25 lgG3, wherein an immunoglobulin-like molecule comprising said fusion protein is produced. 

9. A fusion protein encoded by a gene according to claim 1 . 

10. A fusion protein of claim 9, which further comprises a therapeutic agent, a detectable label or NMR 
30 imaging agent linked to said fusion protein. 

11. The fusion proteins CD4H 7 1, CD4Mu, CD4Pn, CD4E 7 1 producable by the vectors ATCC 67611, 
67608, 67609 and 67610, respectively or CD3B7I according to the follpwing amino acid sequence 
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GCCTCTTTGAGAAGCAGCGCGCAAGAAAGACGCAAGCCCAGAGGCCCTGCCATTTCTGTG 

1 ♦ « ♦ ♦ ♦ * 60 

CGGACAAACTCTTCGTCGCCCGTTCTTTCTGCGTTCGGGTCTCCGGGACGGTAAAGACAC 
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/ / // / / / 

GCCTCAGGTCCCTACTGGCTCAGGCCCCTCCCTCCCTCGGCAAGGCCACAATGAACCGGG 

61 ♦ * ♦-- * 120 

CCGAGTCCAGGGATGACCGAGTCCGGGGACGCAGGGAGCCGTTCCGGTGTTACTTGCCCC 
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GAAGCCTTTGGGACCAAGGAAACTTCCCCCTCATCATCAAGAATCTTAACATAGAAGACT 
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CTTCCGAAACCCTCGTTCCTTTGAACCGCGACTAGTAGTTCTTAGAATTCTATCTTCTGA 
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12. An immunoglobulin-like molecule, comprising the fusion protein of claim 9 and an immunoglobulin light 
chain. 
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13. An immunoglobulin-like molecule of claim 12, further comprising a therapeutic agent, detectable label 
or NMR imaging agent linked to said immunoglobulin-like molecule. 

14. A complex comprising the fusion protein of claim 9 or 10 and HIV or SIV gp120. 

5 

15. The complex of claim 14, wherein said gp120 is a part of an HIV or SIV, is expressed on the surface of 
an HIV or SIV-infected cell or is present in solution. 

16. A method of using a fusion protein of claim 9 or 10 for the detection of HIV or SIV gp120 in a sample, 
w characterized by 

(a) contacting a sample suspected of containing HIV or SIV gp120 with said fusion protein and 

(b) detecting whether a complex is formed, said fusion protein being detectably labeled. 

Claims for the following Contracting State : ES 

75 

1. A method of producing a fusion protein encoded by a gene encoding a fusion protein which is capable 
of being secreted comprising 1) the DNA sequence of extracellular CD4, or fragment thereof which 
binds to HIV gp 120 when fused to an immunoglobulin chain, and 2) the DNA sequence of an 
immunoglobulin heavy chain, characterized in that the DNA sequence which encodes at least the 

20 variable region of said immunoglobulin has been replaced with the DNA sequence which encodes 
extracellular CD4, or said gp 120 binding fragment thereof, characterized by cultivating in a nutrient 
medium under protein-producing conditions, a host strain transformed with the vector comprising the 
gene of claim 1, said vector further comprising expression signals which are recognized by said host 
strain and direct expression of said fusion protein, and recovering the fusion protein so produced. 

25 

2. The method of claim 1, wherein said host strain is a myeloma cell line which produces immunoglobulin 
light chains and said fusion protein comprises an immunoglobulin heavy chain of the class IgM, lgG1 or 
lgG3, wherein an immunoglobulin-like molecule comprising said fusion protein is produced. 

30 3. The methods of claim 1 or 2, wherein the vector having the identifying characteristics of pCD4H7l, 
which has been deposited under Accession No. 6761 1 , or pCD4Mjx, which has been deposited under 
Accession No. 67608, or of pCH4Pu, which has been deposited under Accession No. 67609, or of 
pCD4E 7 1, which has been deposited under Accession No. 67610, all in E. coli at the ATCC under the 
terms of the Budapest Treaty. 

35 

Claims for the following Contracting State : GR 

1. A vector comprising a gene encoding a fusion protein which is capable of being secreted comprising 1) 
the DNA sequence of extracellular CD4, or fragment thereof which binds to HIV gp 120 when fused to 
40 an immunoglobulin chain, and 2) the DNA sequence of an immunoglobulin heavy chain, characterized 
in that the DNA sequence which encodes at least the variable region of said immunoglobulin has been 
replaced with the DNA sequence which encodes extracellular CD4, or said gp 120 binding fragment 
thereof. 

45 2. A vector according to claim 1, characterized in the said immunoglobulin chain is of the class IgM, lgG1 
or lgG3. 

3. A vector having the identifying characteristics of pCD4H 7 1, which has been deposited under Accession 
No. 67611, or pCD4Mji, which has been deposited under Accession No. 67608, or of pCH4Pu, which 

so has been deposited under Accession No. 67609, or of pCD4E 7 1, which has been deposited under 
Accession No. 67610, all in E. coli at the ATCC under the terms of the Budapest Treaty. 

4. A host transformed with the vector of claims 1-3. 

55 5. The host of claim 4 which expresses an immunoglobulin light chain together with the expression 
product of said fusion protein gene to give an immunoglobulin-like molecule which binds to gp 120. 
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6. The host of claim 5, wherein said immunoglobulin heavy chain is of the immunoglobulin class IgM, 
lgG1 or lgG3. 

7. A method of producing a fusion protein encoded by a gene of a vector of claim 1, characterized by 
5 cultivating in a nutrient medium under protein-producing conditions, a host strain transformed with the 

vector of claim 1, said vector further comprising expression signals which are recognized by said host 
strain and direct expression of said fusion protein, and recovering the fusion protein so produced. 

8. The method of claim 7, wherein said host strain is a myeloma cell line which produces immunoglobulin 
10 light chains and said fusion protein comprises an immunoglobulin heavy chain of the class IgM, IgGl or 

lgG3, wherein an immunoglobulin-like molecule comprising said fusion protein is produced. 

9. A method for the detection of HIV or SIV gp 120 in a sample, characterized by 

a) contacting a sample suspected of containing HIV or SIV gp 120 with a fusion protein produced by 
75 a method according to claim 7 or 8 and 

b) detecting whether a complex is formed, said fusion protein being detectably labeled. 

10. A method according to claim 9, wherein an immunoglobulin-like molecule further comprising a 
therapeutic agent, detectable label or NMR imaging agent linked to said immunoglobulin-like molecule. 

20 

Patentanspruche 

Patentanspriiche fur folgende Vertragsstaaten : AT, BE, CH, DE, FR, GB, IT, LI, LU, NL, SE 

1. Gen, welches fur ein Fusionsprotein kodiert, welches sezerniert werden kann, und welches Gen enthalt: 
25 1) die DNA-Sequenz von extrazellularem CD4, oder eines Fragmentes hievon, welches CD4 bzw. 

Fragment hievon sich an das HIV gp 120 bindet, wenn dieses CD4 bzw. Fragment hievon an eine 
Immunglobulinkette fusioniert ist, und 2) die DNA-Sequenz einer Schweren Kette eines Immunglobulins, 
dadurch gekennzeichnet, da(3 die DNA-Sequenz, welche wenigstens fur die variable Region dieses 
Immunglobulins kodiert, durch jene DNA-Sequenz ersetzt worden ist, welche fur das extrazellulare CD4 
30 oder fur das gp-120-bindende Fragment hievon kodiert. 

2, Fusionsprotein-Gen nach Anspruch 1, dadurch gekennzeichnet, da!3 die Immunglobulinkette eine solche 
der Klassen IgM, lgG1 oder lgG3 ist. 

35 3. Vektor, welcher das Fusionsprotein-Gen nach Anspruch 1 enthalt, oder welcher die identifizierenden 
Kennmerkmale des pCD4Hy1 aufweist, welches unter der Hinterlegungsnummer 67611 hinterlegt 
worden ist, oder welcher die identifizierenden Kennmerkmale des pCD4Mu aufweist, welches unter der 
Hinterlegungsnummer 67608 hinterlegt worden ist, oder welcher die identifizierenden Kennmerkmale 
des pCH4Pu aufweist, welches unter der Hinterlegungsnummer 67609 hinterlegt worden ist, oder 

40 welcher die identifizierenden Kennmerkmale des pCD4E 7 1 aufweist, welches unter der Hinterlegungs- 
nummer 67610 hinterlegt worden ist, wobei alle diese Plasmide in E.coli bei der ATCC gemaB den 
Bestimmungen des Budapester Vertrages hinterlegt worden sind. 

4. Wirtszelle bzw. Wirtsorganismus, die bzw. der mit dem Vektor nach Anspruch 3 transformiert worden 
45 ist. 

5. Wirtszelle bzw. Wirtsorganismus nach Anspruch 4, welche bzw. welcher eine Leichte Kette eines 
Immunglobulins zusammen mit dem Expressionsprodukt des genannten Fusionsprotein-Gens expri- 
miert, wobei ein immunglobulinartiges Molekul entsteht, welches sich an das gp120 bindet. 

50 

6. Wirtszelle bzw. -organismus nach Anspruch 5, wobei die Schwere Kette des Immunglobulins der 
Immunglobulinklasse IgM, IgGl oder lgG3 angehort. 

7. Verfahren zur Herstellung eines Fusionsproteins, fur welches ein Gen nach Anspruch 1 kodiert, dadurch 
55 . gekennzeichnet, daG man in einem Nahrmedium, unter die Proteinbildung fordernden Bedingungen, 

einen Stamm von Wirtszellen bzw. -organismen kultiviert, welche mit dem Vektor transformiert worden 
sind, welcher letztgenannte das Gen nach Anspruch 1 enthalt, wobei dieser Vektor weiterhin Expres- 
sionssignale enthalt, welche von dem genannten Stamm von Wirtszellen bzw. -organismen erkannt 
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werden, und welche die Expression des Fusionsproteins dirigieren, und daB man das so entstandene 
Fusionsprotein gewinnt. 

8. Verfahren nach Anspruch 7, wobei der Stamm von Wirtszellen eine Myelom-Zellinie ist, welche Leichte 
Ketten eines Immunglobulins produziert, und wobei das Fusionsprotein eine Schwere Kette eines 
Immunglobulins der Klasse IgM, lgG1 Oder lgG3 enthalt, und wobei ein immunglobulinartiges Molekul 
erzeugt wird, welches das genannte Fusionsprotein umfaSt. 

9. Fusionsprotein, fur welches ein Gen nach Anspruch 1 kodiert. 

10. Fusionsprotein, welches weiterhin ein therapeutisches Mittel, ein nachweisbares Label Oder ein der 
bildlichen Darstellung mit NMR dienendes Mittel an das Fusionsprotein gebunden enthalt. 

11. Die Fusionsproteine CD4H 7 1, CD4Mu, CD4Pu, CD4E 7 1, welche mit den Vektoren ATCC 67611, 
67608, 67609 bzw. 67610 oder CD3B7I gemaB der folgenden Aminosaurensequenz hergestellt werden 
konnen: 
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12. Immunglobulinartiges Molekiil, welches das Fusionsprotein nach Anspruch 9 und eine Leichte Kette 
eines Immunglobulins enthait. 

13. Immunglobulinartiges Molekiil nach Anspruch 12, welches weiterhin ein therapeutisches Mittel, ein 
nachweisbares Label Oder ein der bildlichen Darstellung mit NMR dienendes Mittel an das immunglo- 
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bulinartige Molekul gebunden enthalt. 

14. Komplex, welcher das Fusionsprotein nach Anspruch 9 oder 10 und das HIV- oder SIV-gp120 enthalt. 

5 15, Komplex nach Anspruch 14, wobei das genannte gp120 einen Teil eines HIV oder SIV bildet, auf der 
Oberflache einer mit HIV oder SIV infizierten Zelle exprimiert wird oder in Losung vorliegt. 

16. Verfahren zur Verwendung eines Fusionsproteins nach Anspruch 9 oder 10 fur den Nachweis von HIV- 
oder SIV-gp120 in einer Probe, dadurch gekennzeichnet, daG man: 
70 (a) eine Probe, welche unter Verdacht steht, daG sie das HIV- oder SIV-gp120 enthalt, mit dem 

Fusionsprotein in Beruhrung bringt; und 

(b) feststellt, ob sich ein Komplex bildet, wobei das genannte Fusionsprotein nachweisbar markiert 
ist. 

75 Patentanspriiche fur folgenden Vertragsstaat : ES 

1. Verfahren zur Herstellung eines Fusionsproteins, fur welches ein Gen kodiert, welches Gen fur ein 
Fusionsprotein kodiert, welches sezerniert werden kann, und welches Gen enthalt: 1) die DNA-Sequenz 
von extrazellularem CD4, oder eines Fragmentes hievon, welches CD4 bzw. Fragment hievon sich an 

20 das HIV gp 120 bindet, wenn dieses CD4 bzw. Fragment hievon an eine Immunglobulinkette fusioniert 
ist, und 2) die DNA-Sequenz einer Schweren Kette eines Immunglobulins, dadurch gekennzeichnet, daG 
die DNA-Sequenz, welche wenigstens fur die variable Region dieses Immunglobulins kodiert, durch 
jene DNA-Sequenz ersetzt worden ist, welche fur das extrazellulare CD4 oder fur das gp-120-bindende 
Fragment hievon kodiert, dadurch gekennzeichnet, daB man in einem Nahrmedium, unter die Proteinbil- 

25 dung fordernden Bedingungen, einen Stamm von Wirtszellen bzw. -organismen kultiviert, welche mit 
dem Vektor transformiert worden sind, welcher letztgenannte das Gen nach Anspruch 1 enthalt, wobei 
dieser Vektor weiterhin Expressionssignale enthalt, welche von dem genannten Stamm von Wirtszellen 
bzw. -organismen erkannt werden, und welche die Expression des Fusionsproteins dirigieren, und daG 
man das so entstandene Fusionsprotein zurtickgewinnt. 

30 

2. Verfahren nach Anspruch 1, wobei der Stamm von Wirtszellen eine Myelom-Zellinie ist, welche Leichte 
Ketten eines Immunglobulins produziert, und wobei das Fusionsprotein eine Schwere Kette eines 
Immunglobulins der Klasse IgM, lgG1 oder lgG3 enthalt, und wobei ein immunglobulinartiges Molekul 
erzeugt wird, welches das genannte Fusionsprotein enthalt. 

35 

3. Verfahren nach Anspruch 1 oder 2, wobei der Vektor die identifizierenden Kennmerkmale des 
pCD4H 7 1 aufweist, welches unter der Hinterlegungsnummer 67611 hinterlegt worden ist, oder welcher 
die identifizierenden Kennmerkmale des pCD4Mu aufweist, welches unter der Hinterlegungsnummer 
67608 hinterlegt worden ist, oder welcher die identifizierenden Kennmerkmale des pCH4Pu aufweist, 

40 welches unter der Hinterlegungsnummer 67609 hinterlegt worden ist, oder welcher die identifizierenden 
Kennmerkmale des pCD4E 7 1 aufweist, welches unter der Hinterlegungsnummer 67610 hinterlegt 
worden ist, wobei alle diese Plasmide in E.coli bei der ATCC gemaG den Bestimmungen des 
Budapester Vertrages hinterlegt worden sind. 

45 Patentanspriiche fur folgenden Vertragsstaat : GR 

1. Vektor, welcher ein Gen enthalt, welches fur ein Fusionsprotein kodiert, welches sezerniert werden 
kann, und welches Gen enthalt: 1) die DNA-Sequenz von extrazellularem CD4, oder eines Fragmentes 
hievon, welches CD4 bzw. Fragment hievon sich an das HIV gp 120 bindet, wenn dieses CD4 bzw. 
so Fragment hievon an eine Immunglobulinkette fusioniert ist, und 2) die DNA-Sequenz einer Schweren 
Kette eines Immunglobulins, dadurch gekennzeichnet, daG die DNA-Sequenz, welche wenigstens fur 
die variable Region dieses Immunglobulins kodiert, durch jene DNA-Sequenz ersetzt worden ist, welche 
fur das extrazellulare CD4 oder fur das gp-120-bindende Fragment hievon kodiert. 

55 2. Vektor nach Anspruch 1 , dadurch gekennzeichnet, daG die Immunglobulinkette eine solche der Klassen 
IgM, lgG1 oder lgG3 ist. 



vn 



EP 0 325 262 B1 



3. Vektor, welcher die identifizierenden Kennmerkmale des pCD4H 7 1 aufweist, welches unter der Hinterle- 
gungsnummer 67611 hinterlegt worden ist, Oder welcher die identifizierenden Kennmerkmale des 
pCD4Mu aufweist, welches unter der Hinterlegungsnummer 67608 hinterlegt worden ist, oder welcher 
die identifizierenden Kennmerkmale des pCH4Pn aufweist, welches unter der Hinterlegungsnummer 
67609 hinterlegt worden ist, oder welcher die identifizierenden Kennmerkmale des pCD4E 7 1 aufweist, 
welches unter der Hinterlegungsnummer 67610 hinterlegt worden ist, wobei alle diese Plasmide in 
E.coli bei der ATCC gemaB den Bestimmungen des Budapester Vertrages hinterlegt worden sind. 

4. Wirtszelle bzw. Wirtsorganismus, die bzw. der mit dem Vektor nach den Anspruchen 1 bis 3 
transformiert worden ist. 

5. Wirtszelle bzw. Wirtsorganismus nach Anspruch 4, welche bzw. welcher eine Leichte Kette eines 
Immunglobulins zusammen mit dem Expressionsprodukt des genannten Fusionsprotein-Gens expri- 
miert, wobei ein immunglobulinartiges Molekul entsteht, welches sich an das gp120 bindet. 

6. Wirtszelle bzw. Wirtsorganismus nach Anspruch 5, wobei die Schwere Kette des Immunglobulins der 
Immunglobulinklasse IgM, lgG1 oder lgG3 angehort. 

7. Verfahren zur Herstellung eines Fusionsproteins, fur welches ein Gen eines Vektors nach Anspruch 1 
kodiert, dadurch gekennzeichnet, daB man in einem Nahrmedium, unter die Proteinbildung fordernden 
Bedingungen, einen Stamm von Wirtszellen bzw. -organismen kultiviert, welche mit dem Vektor nach 
Anspruch 1 transformiert worden sind, wobei dieser Vektor weiterhin Expressionssignale enthalt, welche 
von dem genannten Stamm von Wirtszellen bzw. -organismen erkannt werden, und welche die 
Expression des Fusionsproteins dirigieren, und daB man das so entstandene Fusionsprotein gewinnt. 

8. Verfahren nach Anspruch 7, wobei der Stamm von Wirtszellen eine Myelom-Zellinie ist, welche Leichte 
Ketten eines Immunglobulins produziert, und wobei das Fusionsprotein eine Schwere Kette eines 
Immunglobulins der Klasse IgM, lgG1 oder lgG3 enthalt, und wobei ein immunglobulinartiges Molekul 
erzeugt wird, welches das genannte Fusionsprotein umfaBt. 

9. Verfahren zum Nachweisen von HIV- oder SIV-gp120 in einer Probe, dadurch gekennzeichnet, daB 
man: 

(a) eine Probe, welche unter Verdacht steht, daB sie das HIV- oder SIV-gp120 enthalt, mit einem 
Fusionsprotein in Beruhrung bringt, welches nach einem Verfahren nach Anspruch 7 oder 8 erzeugt 
worden ist, und daB man 

(b) feststellt, ob sich ein Komplex bildet, wobei das genannte Fusionsprotein nachweisbar markiert 
ist. 

10. Verfahren nach Anspruch 9, wobei ein immunglobulinartiges Molekul ferner ein therapeutisches Agens, 
eine nachweisbare Markierung oder ein NMR abbildendes Agens enthalt, welches an das genannte 
immunglobulinartige Molekul gebunden ist. 

Revendications 

Revendications pour les Etats contractants suivants : AT, BE, CH, DE, FR, GB, IT, LI, LU, NL, SE 

1. Gene codant pour une proteine de fusion qui est apte a etre secretee, comprenant 1) la sequence 
d'ADN de CD4 extracellulaire, ou d'un fragment de CD4 qui se lie a gp120 de HIV lorsqu'elle est 
fusionnee a une chatne d'immunoglobuline, et 2) la sequence d'ADN d'une chaTne lourde d'immunoglo- 
buline, caracterise en ce que la sequence d'ADN qui code pour au moins la region variable de ladite 
immunoglobuline a ete remplacee par la sequence d'ADN qui code pour la proteine CD4 extracellulaire, 
ou ledit fragment de celle-ci qui se lie a gp120. 

2. Proteine de fusion selon la revendication 1, caracterisee en ce que ladite chaTne d'immunoglobuline 
appartient a la classe IgM, lgG1 ou lgG3. 

3. Vecteur comprenant le gene de proteine de fusion de la revendication 1 , ou ayant les caracteristiques 
d'identification de pCD4H 7 1, qui a ete depose sous le n° 67611, ou de pCD4Mn, qui a ete depose 
sous le n° 67608, ou de pCH4Pu, qui a ete depose sous le n° 67609, ou de pCD4E 7 1, qui a ete 
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depose sous le n° 67610, tous dans E. coll a I'ATCC, sous les clauses du Traite de Budapest. 

4. Hote transforme par le vecteur de la revendication 3. 

5. Hote de la revendication 4, qui exprime une chaTne legere d'immunoglobuline conjointement avec le 
produit d'expression dudit gene de proteine de fusion, pour donner une molecule de type immunoglo- 
buline qui se lie a gp120. 

6. Hote de la revendication 5, dans lequel ladite chaTne lourde d f immunoglobuline appartient a la classe 
d'immunoglobulines IgM, lgG1 ou lgG3. 

7. Procede de production d'une proteine de fusion codee par un gene de la revendication 1, caracterise 
par la culture, dans un milieu nutritif, dans des conditions productrices de proteines, d'une souche hote 
transformee par le vecteur comprenant le gene de la revendication 1, ledit vecteur comprenant en outre 
des signaux d'expression qui sont reconnus par ladite souche hote et dirigent I'expression de ladite 
proteine de fusion, et la recuperation de la proteine de fusion ainsi produite. 

8. Procede de la revendication 7, dans lequel ladite souche hote est une lignee cellulaire de myelome qui 
produit des chaTnes legeres d'immunoglobulines, et ladite proteine de fusion comprend une chaTne 
lourde d'immunoglobuline appartenant a la classe IgM, lgG1 ou lgG3, et dans lequel est produite une 
molecule de type immunoglobuline comprenant ladite proteine de fusion. 

9. Proteine de fusion codee par un gene selon la revendication 1. 

10. Proteine de fusion de la revendication 9, qui en outre comprend un agent therapeutique, un marqueur 
detectable, ou un agent d'imagerie de RMN, lies a ladite proteine de fusion. 

11. Proteines de fusion CD4H 7 1, CD4Mu, CD4Pu, GD4E 7 1, pouvant etre produites par les vecteurs ATCC 
67611, 67608, 67609 et 67610, respectivement, ou CD3B 7 1 conformement a la sequence d'aminoaci- 
des suivante 
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12. Molecule de type immunoglobuline, comprenant la proteine de fusion de la revendication 9 et une 
chatne legere d'immunoglobuline. 
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13. Molecule de type immunoglobuline de la revendication 12, comprenant en outre un agent therapeuti- 
que, un marqueur detectable, ou un agent d'imagerie de RMN, lies a ladite molecule de type 
immunoglobuline. 

14. Complexe comprenant la proteine de fusion de la revendication 9 ou 10 et gp120 de HIV ou de SIV. 

15. Complexe de la revendication 14, dans lequel ladite proteine gp120 est une partie d'un HIV ou SIV, est 
exprimee sur la surface d'une cellule infectSe par le HIV ou le SIV, ou est presente en solution. 

16. Procede d'utilisation d'une proteine de fusion de la revendication 9 ou 10 pour la detection de gp120 
de HIV ou SIV dans un echantillon, caracterise par 

(a) la mise en contact d'un echantillon, soupgonne contenir gp120 de HIV ou de SIV, avec ladite 
proteine de fusion, et 

(b) la detection de la formation ou non d'un complexe, ladite proteine de fusion etant marquee de 
fagon detectable. 

Revendications pour I'Etat contractant suivant : ES 

1. Procede de production d'une proteine de fusion codee par un gene codant pour une proteine de fusion 
qui est apte a etre secretee, comprenant 1) la sequence d'ADN de CD4 extracellulaire, ou d'un 
fragment de CD4 qui se lie a gp120 de HIV lorsqu'elle est soudee a une chatne d'immunoglobuline, et 
2) la sequence d'ADN d'une chatne lourde d'immunoglobuline, caracterise en ce que la sequence 
d'ADN qui code au moins pour la region variable de ladite immunoglobuline a ete remplacee par la 
sequence d'ADN qui code pour la proteine CD4 extracellulaire, ou ledit fragment de CD4 qui se lie a 
gp120, caracterise par la culture dans un milieu nutritif, dans des conditions productrices de proteines, 
d'une souche hote transformee par le vecteur comprenant le gene de la revendication 1 , ledit vecteur 
comprenant en outre des signaux d'expression qui sont reconnus par ladite souche hote et dirigent 
I'expression de ladite proteine de fusion, et la recuperation de la proteine de fusion ainsi produite. 

2. Procede de la revendication 1, dans lequel ladite souche hote est une lignee cellulaire de myelome qui 
produit des chatnes legeres d'immunoglobulines, et ladite proteine de fusion comprend une chaTne 
lourde d'immunoglobuline appartenant a la classe IgM, IgGt ou lgG3, et dans lequel est produite une 
molecule de type immunoglobuline comprenant ladite proteine de fusion. 

3. Procede de la revendication 1 ou 2, dans lequel le vecteur a les caracteristiques d'identification de 
pCD4H 7 1, qui a ete depose sous le n° 67611, ou de pCD4Mu, qui a ete depose sous le n° 67608, ou 
de pCH4Pu, qui a ete depose sous le n° 67609, ou de pCD4E 7 1, qui a ete depose sous le n° 67610, 
tous dans E. coli, a I'ATCC, sous les clauses du Traite de Budapest. 

Revendications pour I'Etat contractant suivant : GR 

1. Vecteur comprenant un gene codant pour une proteine de fusion qui est apte a etre secrete, 
comprenant 1) la sequence d'ADN de CD4 extracellulaire, ou d'un fragment de CD4 qui se lie a gp120 
de HIV lorsqu'elle est fusionnee a une chaTne d'immunoglobuline, et 2) la sequence d'ADN d'une 
chaTne lourde d'immunoglobuline, caracterise en ce que la sequence d'ADN qui code au moins pour la 
region variable de ladite immunoglobuline a ete remplacee par la sequence d'ADN qui code pour la 
proteine CD4 extracellulaire, ou ledit fragment de celle-ci qui se lie a gp120. 

2. Vecteur selon la revendication 1, caracterise en ce que ladite chaTne d'immunoglobuline appartient a la 
classe IgM, IgGl ou lgG3. 

3. Vecteur ayant les caracteristiques d'identification de pCD4H 7 1, qui a ete depose sous le n° 67611, ou 
de pCD4Mu, qui a ete depose sous le n° 67608, ou de pCH4Pn, qui a ete depose sous le n° 67609, ou 
de pCD4E 7 1, qui a ete depose sous le n° 67610, tous dans E. coll a I'ATCC, sous les clauses du 
Traite de Budapest. 

4. Hote transforme par le vecteur des revendications 1 a 3. 
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5. Hote de la revendication 4, qui exprime une chatne legere d'immunoglobuline conjointement avec le 
produit d'expression dudit gene de proteine de fusion, pour donner une molecule de type immunoglo- 
buline qui se lie a gp120. 

6. Hote de la revendication 5, dans lequel ladite chatne lourde d'immunoglobuline appartient a la classe 
d'immunoglobulines IgM, lgG1 ou lgG3. 

7. Procede de production d'une proteine de fusion codee par un gene d'un vecteur de la revendication 1, 
caracterise par la culture, dans un milieu nutritif, dans des conditions productrices de proteines, d'une 
souche hote transformee par le vecteur de la revendication 1, ledit vecteur comprenant en outre des 
signaux depression qui sont reconnus par ladite souche hote et dirigent Texpression de ladite 
proteine de fusion, et la recuperation de la proteine de fusion ainsi produite. 

8. Procede de la revendication 7, dans lequel ladite souche hote est une lignee cellulaire de myelome qui 
produit des chaTnes legeres d'immunoglobulines, et ladite proteine de fusion comprend une chatne 
lourde d'immunoglobuline appartenant a la classe IgM, lgG1 ou lgG3, et dans lequel est produite une 
molecule de type immunoglobuline comprenant ladite proteine de fusion. 

9. Procede pour la detection de gp120 de HIV ou de SIV dans un echantillon, caracterise par 

(a) la mise en contact d'un echantillon, soupgonne contenir gp120 de HIV ou de SIV, avec une 
proteine de fusion produite par un procede selon la revendication 7 ou 8, et 

(b) la detection de la formation ou non d'un complexe, ladite proteine de fusion etant marquee de 
fagon detectable. 

10. Procede selon la revendication 9, dans lequel une molecule de type immunoglobuline comprend en 
outre un agent therapeutique, un marqueur detectable, ou un agent d'imagerie de RMN, lies a ladite 
molecule de type immunoglobuline. 
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